Gene Slin_4568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4568 
Symbol 
ID8728332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5540057 
End bp5541397 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content55% 
IMG OID 
ProductPARP catalytic domain protein 
Protein accessionYP_003389346 
Protein GI284039416 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0566716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGC TAAATCAGTT GTTACAACGG GTGTGGGGGC AGTCTGCACC AGCCACCCTC 
CTTCAAACCT CCCGCCCTGT ATCGTCTACA GCCGAACCCG TTTTGGCTTC GCAGCCAATC
CGGCACCTGC GTACCATGAA GCTCATGATG GTAACGGCCG AGAACAACAA CAAATACTAT
GAAATGCGGG AGAAGGAAAA TGGCACCTTC GAAGCACACT ATGGCCGCGT AGGTGGTTTC
CGAAGCAAAT CGGTTCACCC AATGGCGCAG TGGGAACGGA AAGTCCGCGA GAAAAAATCG
AAGGGCTACA CCGATCAGAC CCACCTCTTT GCCGAAAGCC GACCTGAAAC AGATGCCAGT
ACCATTGCCA ATCCAGAGGT TCGAAACCTG ATGGCACGAC TGCTGGAATT AGCCAGACAG
TCCATCTTTC AGAACTATGT CGTTACGGCT CAGGAGGTAA CGCGTAAACA GGTCGAACAC
GCTCAGCAAC TGCTCGACGA GCTGGCCAAC CTTCTGGCGG TCAACATGGA TACAGCGGCC
TACAATGCCA AATTGCTGGA GCTGTTCAAG GTGATACCCC GCCGAATGAG CAAGGTAGGG
GAGCACTTGG CAGGCGCATC GCCCCAGTCG GCCGAGGAAC TACAACCCCT TCGCGACCAC
CTCGCCGAAG AGCAGTCTAC CCTCGATGTG ATGCGCGGTC AGGTTGAACT GGCCCCCGAA
TCAACCCCCG ATGACCAGCC CCAGCCAACG CTGCTGGACA GCCTGAACCT GGCTATCGAG
CCGGTGACCG ATGCGCATAT CATTACCCTG ATCAAGCGCA TGATGGGTAC CGATGCCGTC
AAATTCGACG CAGCGTTCAG TGTCCGCCAT ACGGCTACCG ATGCCGCTTT CGATGCCTAT
GTGCGTCAAC AGAAGAATCG AAAAACCATG GCGCTCTGGC ACGGAAGCCG CAGCGAGAAC
TGGCTGTCTA TTTTGAAAAC GGGGCTGTTG CTGCGTCCGG CCAATGCCGT TATTACGGGC
AAGATGTTTG GCTACGGCAT CTACTTCGCC GACCAGTTCA GCAAATCGCT CAACTACACC
TCACTGAACG GCTCGGTATG GGCCAACGGT CGGCAGTCGG AAGGGTATCT GGCTATTTAT
GAAGTACATG TCGGCGAACA ACTGGAGCTG ACAAAACACG AACCCAGCCA CATGCAACTG
GACATTAACG CCCTGAAGCA ACTTGACCCA CAGTATGATT CCGTATTTGC CCGGCAGGGG
GTCAGCCTGC AGAAGAACGA ATTCATTGTA TACAACCCGG CGCAGTGTAC GGTTCGGTAC
ATTGTCAAAA TCAAGGCTTA A
 
Protein sequence
MTMLNQLLQR VWGQSAPATL LQTSRPVSST AEPVLASQPI RHLRTMKLMM VTAENNNKYY 
EMREKENGTF EAHYGRVGGF RSKSVHPMAQ WERKVREKKS KGYTDQTHLF AESRPETDAS
TIANPEVRNL MARLLELARQ SIFQNYVVTA QEVTRKQVEH AQQLLDELAN LLAVNMDTAA
YNAKLLELFK VIPRRMSKVG EHLAGASPQS AEELQPLRDH LAEEQSTLDV MRGQVELAPE
STPDDQPQPT LLDSLNLAIE PVTDAHIITL IKRMMGTDAV KFDAAFSVRH TATDAAFDAY
VRQQKNRKTM ALWHGSRSEN WLSILKTGLL LRPANAVITG KMFGYGIYFA DQFSKSLNYT
SLNGSVWANG RQSEGYLAIY EVHVGEQLEL TKHEPSHMQL DINALKQLDP QYDSVFARQG
VSLQKNEFIV YNPAQCTVRY IVKIKA