Gene Tery_3131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3131 
Symbol 
ID4244261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4787592 
End bp4788803 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content27% 
IMG OID638108141 
Producthypothetical protein 
Protein accessionYP_722734 
Protein GI113476673 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.242131 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.303319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACTTG GAATTTATTT TACTATAGCG TGCGCTTTTA TAATTATTAT ATTAGCAGTA 
TTGTTAACTC CTTTTCCACA TTGGCCTACT TATTATACTA TTTGGAAACA TATCTTCTTC
GATCCAGAGC CATCTATCAA ATTTAGACAT AGAAAAAGAC AAGTTTTTTA TTTACTTAAC
TATGCTTTAA AACTTCCTTT TGTCTCATTA TGTTGGTGTC TTGATGAAAT TTTATTCTCT
CAATATCGGG AAATGAAATT GTCTACACCT ATTTTTATAG TTGCTCAACC TCGTAGTGCC
ACAACATTTT TACATCGTAC TTTAGCTTCA GATGAAAAAA AATTTTTCTC TATTTGTTTA
TTAGAATGGC GTTATCCTTC TATATTAATA CAAAAATTTT TTCAAGCTAC AGGCTATTTA
GAAAAAATGA GTAAAGTCAG CTATTGGGGT AATAACAATG AAGCTAAATT AACTGAAAAA
ATGCACTATA GTTATCTTAA TGATTATGAA GATGATGGTT TTTTGTATGA AGAATGTTTT
TTCCATCAGT TTTATGTGAT TAACCGTTTT CCTTATCCTA AATTAGTAGA TAAACTCAAT
AATTTTCAGG AATTACCGAA CAAAACTAAA GAAAAAATGC TAAAAGCACA TTATCAAGTT
ATTCAAAAAA TTCTTTATTT ACGAGGTGGT AATTTAGTCT ATGTTTCTAA AGATAATGAA
TGTTTGCAAC GTTCGGAATT AATGAAAAAG CTTTATCCTG ATGCTTTATT TATAACTATT
ACCAGAGAAT CAGAAAAATT TATGAACTCT TATATTACTT TAATACATCA ATCAGCTTAT
TCTAAATCTA GAGTAGATGT TAATAATATT TCTGAATGGA TATCGATGCA AAGAAAAGTA
AGGGTAGAAG CAGCAAGTCA AACAATTAAT TTTTTTGAAA GCTTATCTGA AGAACAAAAA
CTGTGTTTTT CTTTTAATAA TTTAACTGAA AATATTAAAG ATTCTATTGA ATTAGTTTAT
CGTAAATTAA ATATTGATTT AACTGAAGCT CAAAGTGAAT ATTTGAAAAA CTTAGATACT
AAACAAAATT TAAGAGATTC TGCTTATAAA ACAGGTTCAG ATAAATTTGA GGAGTTTGCA
TTTTTCGATA AATTTGCAAC TGAAACTGCT AGGAATCATA AAACATTACT TGAAAAATCA
AAAAGAATTT AG
 
Protein sequence
MLLGIYFTIA CAFIIIILAV LLTPFPHWPT YYTIWKHIFF DPEPSIKFRH RKRQVFYLLN 
YALKLPFVSL CWCLDEILFS QYREMKLSTP IFIVAQPRSA TTFLHRTLAS DEKKFFSICL
LEWRYPSILI QKFFQATGYL EKMSKVSYWG NNNEAKLTEK MHYSYLNDYE DDGFLYEECF
FHQFYVINRF PYPKLVDKLN NFQELPNKTK EKMLKAHYQV IQKILYLRGG NLVYVSKDNE
CLQRSELMKK LYPDALFITI TRESEKFMNS YITLIHQSAY SKSRVDVNNI SEWISMQRKV
RVEAASQTIN FFESLSEEQK LCFSFNNLTE NIKDSIELVY RKLNIDLTEA QSEYLKNLDT
KQNLRDSAYK TGSDKFEEFA FFDKFATETA RNHKTLLEKS KRI