Gene Tcr_0266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_0266 
Symbol 
ID3761436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp314195 
End bp315700 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content48% 
IMG OID637784971 
Productanthranilate synthase component I 
Protein accessionYP_390536 
Protein GI78484611 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGATA CACATTTTGC CGCCTTATTT GAGCAGGGTT ATAAAACCGC GCCGGTAATG 
CGGACCATCT TGTCCGATTA TGATACGCCA TTGAGCGTTT ATCATAAGGT GGCGAATCAA
CCCCAAAGTT ATTTGTTTGA ATCGGTTCAA GGTGGTGATA AATGGGGGCG CTACTCCATT
ATTGGTTTGC CTTGTTCGAA ACGCTTAGTG ATTGAAGGCC AGCAAATCAC AGTATTAGAT
GGTGAGCAGG TAGTAGAGCA GCAAGTCTCT GAGGATCCAT TAGCCTGGAT TGAGGCCTTT
CAAGCTCGCT TTAAAGTCTT TCCCCAGCCA GGATTGCCCG CCTTTACCGG CGGTTTGGTG
GGGTATTTTG GTTACGACAC GATTCGCTAT GTCGAGAGTC GTTTACAAGC GTCAGAGCCG
GAAAAAGATG ATATTCACGC CCCGGATATT CAGCTGTTAG TGTCTGAAGA AATTGTTGTG
TTTGACAACC TCAGTGGCCA GGTTCACGTC ATTGTGCATG CGGATTTAAG CCAGAAAAAG
GCCTATTCAA AAGCCGAAGC ACGCGTGGCG GAGATTGCTG AAATGATTCA GCAGCCAATG
TCGGTTCCGA ATGACTTACC CAAAGAAACG ATTTTGACGG AACAGGATTT TGTCTCTAGC
TTTGGAGAGG AAGCCTTTAA ACAAGCCGTT GCCAAAATTA AAGAGTATAT TTTGGCAGGG
GATGCGATGC AGGTCGTGAT CTCACAGCAA ATGTCGGTGG ATTTTGATGC GGCACCAATC
GATCTCTATC GTGCATTACG TTATTTGAAC CCATCGCCTT ACATGTTCTT TTTAGACTTA
GGCGACTTAC AGATTGTCGG CTCCTCGCCT GAAATTCTGG TTCGCTTGGA AGACAATACC
GTCACGGTTC GCCCGATTGC CGGCACACGT CGCCGAGGTC TGACGCCGGA AAAAGATCAT
GCGTTGGAAG TGGATTTGTT ATCCGACCCG AAAGAGTTGG CAGAGCATTT GATGTTGATT
GATCTGGGTC GGAACGATGT AGGTCGTATT GCCAATGTCG GCTCAGTTGA ACTGACCGAA
AAAATGATTG TTGAACGCTA CTCTCATGTC ATGCACATAG TGTCGAACGT CAACGGTCAA
GCCAAACCGG GAATGTCTTC GATTGATGTG CTGCGCGCTA CTTTCCCAGC GGGAACGGTG
TCCGGTGCGC CGAAGATTCG AGCGATGGAA ATTATCGACG AATTAGAACC GGTTAAACGA
GGCGTGTATG CAGGCGCAGT GGGTTACCTC GGTTGGCATG GCAATATGGA TACCGCCATT
GCGATTCGAA CGGCGGTGAT TAAAGATGGA CGTTTATTTG TGCAAGCCGG CGCGGGTGTC
GTAGCCGACT CAGTACCGCA ATCAGAATGG GATGAAACCA TGAATAAAGG ACGCGCGATC
TTTAAAGCCG CTGAGTTTGT CACCAAAGGT CTCAAAACCG ATAACCCCGA TGCTGGGCAT
CGTTAA
 
Protein sequence
MNDTHFAALF EQGYKTAPVM RTILSDYDTP LSVYHKVANQ PQSYLFESVQ GGDKWGRYSI 
IGLPCSKRLV IEGQQITVLD GEQVVEQQVS EDPLAWIEAF QARFKVFPQP GLPAFTGGLV
GYFGYDTIRY VESRLQASEP EKDDIHAPDI QLLVSEEIVV FDNLSGQVHV IVHADLSQKK
AYSKAEARVA EIAEMIQQPM SVPNDLPKET ILTEQDFVSS FGEEAFKQAV AKIKEYILAG
DAMQVVISQQ MSVDFDAAPI DLYRALRYLN PSPYMFFLDL GDLQIVGSSP EILVRLEDNT
VTVRPIAGTR RRGLTPEKDH ALEVDLLSDP KELAEHLMLI DLGRNDVGRI ANVGSVELTE
KMIVERYSHV MHIVSNVNGQ AKPGMSSIDV LRATFPAGTV SGAPKIRAME IIDELEPVKR
GVYAGAVGYL GWHGNMDTAI AIRTAVIKDG RLFVQAGAGV VADSVPQSEW DETMNKGRAI
FKAAEFVTKG LKTDNPDAGH R