Gene SAG1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1007 
Symbol 
ID1013811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1017196 
End bp1018224 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content35% 
IMG OID637316191 
Productiron-compound ABC transporter, iron-compound-binding protein 
Protein accessionNP_688018 
Protein GI22537167 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4607] ABC-type enterochelin transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00224248 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAAA AACTTATTAT TGCTATATTA GCACTATGCA CTATCTTAAC CACTTCTCAA 
GCTGTTTTAG CTAAAGAAAA ATCACAAACT GTTACCATAA AAAACAACTA TTCGGTCTAT
ATTAAAAAAG AAAAAAGAGA CAAGCCGGAT AATAAAAAGC AAATCAGCGA GACACTTAAA
GTTCCTTTAA AACCCAAAAA AGTAGTTGTT TTTGATATGG GAGCTTTGGA TACTATCACA
GCTTTAGGAG CTGAAAAATC TGTTATTGGT ATCCCGAAGG CTAAAAATGC TCTAAGTTTA
TTGCCCAATA ACGTCAAATC TGTTTATAAA GCTAAGAGAT ACCAAGACGT AGGAAGTCTC
TTCGAACCAA ACTTTGAAGC TATTGCTCGT ATGCAACCTG ATGTGGTTTT CCTAGGAGCA
CGTATGGCTT CTGTTGATAA TATTGAAAAA TTAAAGGAGG CTGCACCTAA AGCAGCATTA
GTATATGCTG GAGTCGACTC AAAAAAAGTA TTTGACAAAG GAGTTGCTGA GCGTGTCACA
ATGTTAGGGA AAATCTTCGA CCAAAATAAA AAGGCAAAAA CCTTTAATAA AGATATCGCA
CAAGCTGTTC TTAAATTGCA GAAAACTATT GAGAAAAAAG GTAAACCTAC AGCTCTATTT
GTAATGGCAA ACAGCGGTGA ACTTTTAACT CAATCACCTT CTGGTCGTTT TGGTTGGATT
TTCTCTGTAG GTGGATTTAA AGCAGTCAAT GAAAATGAAA AACTAAGTTC ACATGGTACT
CCCGTATCTT ATGAATACAT CGCTGAAAAA AATCCTAACT ATCTCTTTGT TTTAGATCGT
GGAGCGACTA TTGGACAAGG AGCTTCATCA AAAGAACTTT TTAATAACGA TGTTATTAAA
GCAACTGATG CTGTCAAAAA CAAACGTGTT CATGAGGTAG ATGGAAAAGA TTGGTATATC
AATTCAGGCG GAAGCCGAGT AACACTCCGT ATGATTAAAG ATGTACAGAA CTTTGTTGAT
AATCGTTAA
 
Protein sequence
MTKKLIIAIL ALCTILTTSQ AVLAKEKSQT VTIKNNYSVY IKKEKRDKPD NKKQISETLK 
VPLKPKKVVV FDMGALDTIT ALGAEKSVIG IPKAKNALSL LPNNVKSVYK AKRYQDVGSL
FEPNFEAIAR MQPDVVFLGA RMASVDNIEK LKEAAPKAAL VYAGVDSKKV FDKGVAERVT
MLGKIFDQNK KAKTFNKDIA QAVLKLQKTI EKKGKPTALF VMANSGELLT QSPSGRFGWI
FSVGGFKAVN ENEKLSSHGT PVSYEYIAEK NPNYLFVLDR GATIGQGASS KELFNNDVIK
ATDAVKNKRV HEVDGKDWYI NSGGSRVTLR MIKDVQNFVD NR