Gene Tery_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1603 
Symbol 
ID4242986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2450017 
End bp2451279 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content40% 
IMG OID638106745 
ProductFolC bifunctional protein 
Protein accessionYP_721355 
Protein GI113475294 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.119279 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATTCTA TAGACTCTTT CTTAAAACGC TTTGAACATT TCGGCGTTGA GCTTAGTCTC 
GAACGCATCT ATCAACTTCT AGATAATTTA GGCAAGCCTC ATCTCCAAGT ACCCATAATT
CATGTCGCCG GAACAAATGG CAAAGGTTCT GTCTGTGCTT ACCTATCATC CATCCTCAAA
GCAGCAGGAT ATAAAGTCGG ACGTTACACT TCACCCCACC TGGTTGACTG GACAGAACGT
ATTTGCCTCA ACGAAGAAAA AATTCCCTCA AGTATATTAG AACAATTATT GGTAGAAGTA
GAAACAGCTA TTCAACCAGA ACAACCTTCA CCTACTCAGT TTGAGGTTAT TACTGCCGCA
GCTTGGTTAT ATTTTGCTCA ACAACAAGTA GATGTGGCAG TTGTAGAAGT AGGTTTGGGG
GGCAGACTAG ATGCTACTAA TGTTTGTCAC CAACCAATGG TAACTGTCAT TACTTCTATT
AGTTGGGAAC ATTGGCAAAA ATTAGGTCCG ACTTTGAGTC ATATTGCTGG GGAAAAGGCG
GGTATTCTTA AACCAAGATG TCCTGCTATA GTTGGAGTAT TACCCCCAGA AGCAAAAAAA
GTAGTAGAAA AACGTATTAC TGATCTAGAG TGTCCTGTTG TTTGGGCAGA GCCTGCTAAA
TTATTGGCAG AAAGTCAGGA ATATAACTCA TTGCCAGTAG CTACTTATCA AGGTATTGAA
TATAAGGTGC CATTATTTGG AGATGTACAG TTGATAAATT CAGCGATCGC TATTGCTACA
GTTCAAGTGC TAGAAACACA AGGTTGGGAG ATTCCGATCA CAGCAATTAA ACAAGGAATA
GCTGAAACTA AATGGCTTGG ACGTTTGCAA TGGACTACAT GGCAAAATCG CCAATTATTA
ATTGATGGGG CCCATAATGC TGCTGCAGCC CAAGTATTGG GACGGTATGT CCGGACACTA
AATTATCCAT CTATAAGTTG GGTTATGGGT ATGCTGTCTA CGAAGGAACA TTCGGAAATT
TTTCGGGCAC TTCTTAGACC AGGCGATCGC TTGTATTTAG TTCCTGTTCC TGACCATAGT
TCGGCTGATT TGAATCAACT TAGTACCATA GCTCAGGATA TATGCACAAA ATTAGATTTT
TGTCAAGTTT ATCCCCATTT AACACCGGCT TTAGAAGAGG CTATTACTGA TAGTAAAAAT
TTAATTGTTT TATGTGGTTC TCTATATTTA TTGGGTCATT TTATCAAAAT GAAAAGTAAC
TAA
 
Protein sequence
MNSIDSFLKR FEHFGVELSL ERIYQLLDNL GKPHLQVPII HVAGTNGKGS VCAYLSSILK 
AAGYKVGRYT SPHLVDWTER ICLNEEKIPS SILEQLLVEV ETAIQPEQPS PTQFEVITAA
AWLYFAQQQV DVAVVEVGLG GRLDATNVCH QPMVTVITSI SWEHWQKLGP TLSHIAGEKA
GILKPRCPAI VGVLPPEAKK VVEKRITDLE CPVVWAEPAK LLAESQEYNS LPVATYQGIE
YKVPLFGDVQ LINSAIAIAT VQVLETQGWE IPITAIKQGI AETKWLGRLQ WTTWQNRQLL
IDGAHNAAAA QVLGRYVRTL NYPSISWVMG MLSTKEHSEI FRALLRPGDR LYLVPVPDHS
SADLNQLSTI AQDICTKLDF CQVYPHLTPA LEEAITDSKN LIVLCGSLYL LGHFIKMKSN