Gene Tery_4431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4431 
Symbol 
ID4246084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6829501 
End bp6831891 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content35% 
IMG OID638109314 
ProductFHA domain-containing protein 
Protein accessionYP_723891 
Protein GI113477830 
COG category[V] Defense mechanisms 
COG ID[COG1131] ABC-type multidrug transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAATA AAATAGGTCA AACAACCCTG TTAAGTAATA ATCCTTATAT AGAATTAAAT 
AATCAAGGCC AAATTATCAC CTTCCAACTG ACACAAAAGC GCCATATATT AGGACGCGAT
CGCCGCCGCG CCAATCTAGT AGTGCCCTCA AATTGGTTGC CCATCTCTAA TTATCATGCC
CTCATCAAGA AATCTGACGA TCGCTACCAT ATTTATGATG GTGACGGTTA TCGACCAAGT
ACAAACGGAT TATTTCTCAA CCAGACCCGC ATTACTCCTT CAGAAGGTCA TCCTTTGGAA
AATGGCATAG AAATCAAAAT CGGCCTAGAA CCCCAAAATC AAATCTTGCT CAAATACTTT
GAACCGAATA ATTCTACAGC TTTAACATCT TTACCAAAAA CGTCCTCAAT ATCTTTGCAA
AATAGATCAG TATTACTTGG TCGGGATCCT GACGCTACCC TAGAGTTGAA TGCTCCTATT
ATATCCCGTC ATCATGCCAC CATTGAGCCC AATGCTCAAG GTGAGTACGT TTTACAGGAC
TATAGTAGCA ACGGTGTGTT TGTTAATGGG GTCCGAGTTC AAAGTACTAT TGTTGTGACG
GAGGGAGCAG TAATTAAAAT CGGACCTTTT ACCCTTATTC GTCGTGGGGA TAACTTGGAA
GTCCTCGACC CCGGTAACGA AATTCGTCTT GATGCCTATC GCCTCTACAG AATAGTTAGA
GACCAACGGG GGAAAACACA AGTTTTACTA AATGATATCT CCCTTGCGAT CGAACCTGGG
CAATTTGTTG CTTTAGTCGG AGGAAGTGGA ACTGGTAAGT CGACTTTGAT GCGAACATTA
TTAGGAATAG ACCCGACAAC TAAAGGCCAA GTTTATATTA ATGGTGAAGA TTTAAGGAAT
AATTTTAATA TTTATCGGAC TCAGATAGGT TATGTTCCTC AAGATGATAT TATTCATCCC
GAATTAACAG TATTGGAAGC TTTAACTTAT ACTGCAAAAT TACGCTTACC TCCAGATACT
AATGTTCAAG AAGTAGTGGA AAAAACTATC GAACAAATAG AGATGGCAGA GAGAAAAAAT
ATATTAGTTA GTCAGTTAAG TGGAGGGCAA AGAAAAAGGG TAAGTATTGG CTTAGAATTA
TTAGCAGATC CCAAATTATT TTTCTTAGAT GAACCGACAT CTGGACTCGA CCCAGGATTA
GATAAAAAAA TGATGCAACT ATTAAAAAAG TTGGCAAATC AAGGACGAAC AATTATTTTA
GTTACTCATG CCACAGCTAA TATTAGAATT TGCGATCGCA TAGTTTTTTT AGGTAGAAGT
GGGCGTTTAT GTTACTTTGG TTCTCCGAAA GAAGCTATGA ATTTTTTCTC AATCAATACT
GGTGACTTTG CTGATATTTA CAATGAACTA GAAACCAGTG ATGAAAATAT TAATGAATGG
GTCAATAATT TTCGTCAATC AGAATATTAT CGGCGCTATA TTTCCAATCA TTTGAGTGTT
AATAACTTAA AACCTCCTAC TAACTTACCC CCCAAAAAAC AACTCCCTTC TTTTGCTAAA
CAACTATTTA TTTTAATTCA ACGATATTTA AAATTAATTT TCCGTGACCC GATTAATTTA
GGTTTATCTT TATTAACTGC TCCTATTGGT ATTGGTTTAA TTTTATTTGC AGTTAGAGAT
AAAAATCCAT TAATTGGCGA CCCAGAACCT ACTTTAGCCC CTTTAGCTTT GCGAGTATTA
TTTGTATTTA CTTGTGCCTG TTTATGGGTT GGTCTTTCGA GTTCTCTGCA AGAAATAGTC
AAAGAATCAG CTATTTATAT TCGAGAAAGA TTGGTAAATT TAGGATTATT TGCTTATCTA
GCTTCAAAAG TCATAGTTTT AGGATTATTA GCAATTTTTC AAACTTTATT AATAGCAGGA
GTAATTATCT TGGGATTTGA AAGCCCCCAG CCAGAAATAA TTTCATGGTC AATGGGAGTT
AGTATTACTA CTTTTTTAAC TTTAATAAGT TGTATTAGTA TGGGATTAAT GATTTCTTCT
ATTGTTAAAA ACGCTTCTCA AGCTAATAGT GCTTTACCCT TAATATTGCT ACCTCAGATT
ATTTTTTCTG GAGTTTTATT TGAGATGAAA GGTATAGCTA GTAAATTTTC TTGGGTAATG
TTAAGTCGTT GGTCTGTAGC TGCTTATGGG GCTCTAGTCA ATGTCAATAA AATGGTTCCA
GAAGCAACTA AATTACCAGA TGGTAGTAGG GTTTCTTTAC CATTTTCTGG TTCAGATATT
TATAACCTTA ACTGGGGAAA TTTAGGTTTA AGTTGGGGAG TTTTATGTTT ACATTCACTG
ATTTATTTAG GTTTAACCAT TTGGTTTAAA AAACAGAAAG ATATTATATA A
 
Protein sequence
MSNKIGQTTL LSNNPYIELN NQGQIITFQL TQKRHILGRD RRRANLVVPS NWLPISNYHA 
LIKKSDDRYH IYDGDGYRPS TNGLFLNQTR ITPSEGHPLE NGIEIKIGLE PQNQILLKYF
EPNNSTALTS LPKTSSISLQ NRSVLLGRDP DATLELNAPI ISRHHATIEP NAQGEYVLQD
YSSNGVFVNG VRVQSTIVVT EGAVIKIGPF TLIRRGDNLE VLDPGNEIRL DAYRLYRIVR
DQRGKTQVLL NDISLAIEPG QFVALVGGSG TGKSTLMRTL LGIDPTTKGQ VYINGEDLRN
NFNIYRTQIG YVPQDDIIHP ELTVLEALTY TAKLRLPPDT NVQEVVEKTI EQIEMAERKN
ILVSQLSGGQ RKRVSIGLEL LADPKLFFLD EPTSGLDPGL DKKMMQLLKK LANQGRTIIL
VTHATANIRI CDRIVFLGRS GRLCYFGSPK EAMNFFSINT GDFADIYNEL ETSDENINEW
VNNFRQSEYY RRYISNHLSV NNLKPPTNLP PKKQLPSFAK QLFILIQRYL KLIFRDPINL
GLSLLTAPIG IGLILFAVRD KNPLIGDPEP TLAPLALRVL FVFTCACLWV GLSSSLQEIV
KESAIYIRER LVNLGLFAYL ASKVIVLGLL AIFQTLLIAG VIILGFESPQ PEIISWSMGV
SITTFLTLIS CISMGLMISS IVKNASQANS ALPLILLPQI IFSGVLFEMK GIASKFSWVM
LSRWSVAAYG ALVNVNKMVP EATKLPDGSR VSLPFSGSDI YNLNWGNLGL SWGVLCLHSL
IYLGLTIWFK KQKDII