Gene Tery_2353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2353 
Symbol 
ID4245001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3635609 
End bp3637234 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content41% 
IMG OID638107446 
Producthypothetical protein 
Protein accessionYP_722046 
Protein GI113475985 
COG category[P] Inorganic ion transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0025] NhaP-type Na+/H+ and K+/H+ antiporters
[COG0589] Universal stress protein UspA and related nucleotide-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAGAAA GTATTATTTG GATATTATTA ACTGGCTTTT TTGTCGGTGA AATTGCCTTG 
AGGTTAAAAG CACCACCCCT AATAGGAATG CTATTAGTGG GAATATTATT AGGTCCTCAA
ATTAGCAATA CTATCGACTC TAGCATTTTG GAAGCTGCCG ACTCTCTGCG CACCTTTGCT
GTAACAATAA TTTTAATGAA AGCAGGCTTA GGTCTAGACA GAGAAAAATT AGCACAACAA
GGTAGTGTAG CACTGCGCCT AGGTTTTCTC CCAGCAACCT GTGAAGCTAT TGTTATTGCC
CTAGCCGCTA TGTGGTTACT TAATTTTGAC TTTGCTACCG GGTTGTTATT AGGCTGTATC
ATCGGTGCCG AGTCTCCTGC AGTTATTGTT CCTGGAATGT TGCGGCTCAA AAGTTTGGGT
TGGGGCGTGA CTAAAGGTAT TCCTGATGCA ATTTTAACTG GTAGTGCTTT GTCAGATGTT
TTGCTGTTGT TAATTTTTAG TTTACTACTG GCATTTTTAT CCCAGGGAAC AACTATTGGC
ATAACTTTAC CTGGTGGCAT CACTATAAAC TTATTACAAA TATTACCCTT TCAGGTTACT
CTGCAAATAA TTTTGGGAGC GATCGCTGGG TTGCTGATGG CGCAAATATT AGTATTATTA
TTAGTCAAAC AAAATTGGAC TCAAAATTCT ACCCAAGATA CTTTAGTTGC TGGAAGCCTG
ACCCTCTTAT TAGTAGTCTT AGCAGAAAAA TTCCCCATAT TCTCTGGTTA TTTGGCAGTC
ATGGCAACAG GATTTTTTGT GCTCGAATTT GATCCTCCCC TGGGGCGACG CTTAAGAAAT
GGTTTTGAGA CCTTGTGGAC GATAGCGCAA ATAATTTTGT TTGTTTTGCT CGGTGCTAGT
ATCCCCTTGC AAGTCCTAGA AAATGTCTTC TTAGTGGGGT TATTAATTTT AGCTCTGGGT
ACTTTTGTCG GGCGGATGTT TGGTTGGTAT CTCTCGACCT TGGGCAGTAA TTGGAATTTG
CAAGAACGTC TATTTTTACT ACCTGGAAAT TCTGCAAAAG CAACTGTGCA GGCTGCTATT
GGTGCTATTC CTCTAGCGGT AGGCATTGAG GGAGGAGAAA CAATTTTAGC GATCGCCGCC
CTTTCCATAT TAGTCACAGC ACCTTTGGGA GCTTGGGCAA TACCTGCCTT AGCACCAAAA
CTGTTAGAAA AAGGAGAAGT CGATCCGACA AAAGTAGCGA TCGCTCGTCC TATTATATTA
TTAGCCGCCG TTGATACCTC ACCTCTTGCC GTTGATGTGT TATTAAAAGC TGCAGAACTG
GCGAGGCGTT GTGACGGGGA AATTGTAGTC TTGCACGTTC TTCAATTTGA ATACCGCAAC
AGTATTGAGC AACTGCGACA GCAAACTAGG CAACTTTTGG CAGATATTCG GCATCAGTTT
ATAACTGTTG CGGGTGATGC TTCAGAAGAA ATAATTTTTA TAGCGCAAGA GTATGGAGCA
GCAGAAATTG TCATAGGAAA AAAGGGGGAT CGTTTATGGG ACAATGTGTT AGTTGGGTCA
GTTTCTCAAG CAGTTTTAGA AAAGAGTTTA ATTCCTGTAG TGGTAGTTGA AAAAATAAAA
ACATAG
 
Protein sequence
MLESIIWILL TGFFVGEIAL RLKAPPLIGM LLVGILLGPQ ISNTIDSSIL EAADSLRTFA 
VTIILMKAGL GLDREKLAQQ GSVALRLGFL PATCEAIVIA LAAMWLLNFD FATGLLLGCI
IGAESPAVIV PGMLRLKSLG WGVTKGIPDA ILTGSALSDV LLLLIFSLLL AFLSQGTTIG
ITLPGGITIN LLQILPFQVT LQIILGAIAG LLMAQILVLL LVKQNWTQNS TQDTLVAGSL
TLLLVVLAEK FPIFSGYLAV MATGFFVLEF DPPLGRRLRN GFETLWTIAQ IILFVLLGAS
IPLQVLENVF LVGLLILALG TFVGRMFGWY LSTLGSNWNL QERLFLLPGN SAKATVQAAI
GAIPLAVGIE GGETILAIAA LSILVTAPLG AWAIPALAPK LLEKGEVDPT KVAIARPIIL
LAAVDTSPLA VDVLLKAAEL ARRCDGEIVV LHVLQFEYRN SIEQLRQQTR QLLADIRHQF
ITVAGDASEE IIFIAQEYGA AEIVIGKKGD RLWDNVLVGS VSQAVLEKSL IPVVVVEKIK
T