Gene Tery_3399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3399 
Symbol 
ID4244436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5205556 
End bp5206647 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content36% 
IMG OID638108383 
Producthypothetical protein 
Protein accessionYP_722973 
Protein GI113476912 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.634125 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.211927 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACTGA ATCAAAATCA TAAACCAGAT CAAGAATTAA ATGAAAGCCT TTTTCTGATA 
ACTGCTAAAC AAAACTGCAC TGATTTTGAT TTCTTAATTG ATTATCTGAA TAATCATCAA
GATGAAGTTG AGTCTGCTCT GAGAAATTAT GGTTCTGTGG CTATTAGGGG ATATAAAGTG
AAAACTCCTG AGCAGTTTCA AAAAGTGGGA TTGAGCATTT TTCCTGAGCT GCGAAATCAA
TATCCTGGTG GGGCACCTCG TTATAAAGTT GCTGAATATG TCTGGAGTGC TTCTTCAGTG
CCAAGTTATC AGTCAATTTG TGGTCATACA GAGTTATCTT ACTCGCCATC AGAACAACCT
CCTTACATAT TATTTTTTTC ACCTCAAGTT GCTAAAAGTG GGGGTGAAAC TCCTATTATT
AATATGAAAT CGGTTTTATC AGACTTACCT GAACAGTTGC AGCAGAAGTG TTCCCATACC
CGTTTGGTAA CAAAGTTTTA TTGGGTAAAT ACACAGAAAA GATTATTTGA CGTACGATTA
TGGAAATGGC CTTGGTTTGG TTTTCCTAAA TCTTGGAAGG CTGTCTTTAA TACTGAGGAT
AAAAGTTTAG TTGAAAAAAA ATGCTTTGAT GCAGGGCGAC AGATTAAGTG GCTTGCTAAT
GATGGCTTAA TTGCTCATTA TCCAATGCCA ATAATTGGTT CTCATCCGAT CACAAAGGAA
ATTGCCTGGA CAGGATTTTT CCCTTGGTTT CATATCTGGG GTGTTTGTAT TGATGCATGG
TTTGCTGCTA AGTATCAAAG AAAATTCAGG AGTTGGTTAG TATTTTTGAT CTTATTTTTA
ATCACGCTTG GGCAAATTTG TCTTGAAAAA TTGATTCCAG AAAGGTGCAA ATATCGAGCT
TTGGATGTAG TTTTTGAGGA TGGTTCTGAT TTGTCTTTTT GGGATGTTTA TCATATTGTC
AAAAGTTATT GGAAAAATAC AGAATTATTT TCCTGGCAGG AGGGAGATAT TGTAATTTTA
GATAACTATC GTATGGGGCA TGGTAGGCTA CCTTTTACTG GAGAAAGACA AGTTTATATA
GCCTTCTCCT AG
 
Protein sequence
MKLNQNHKPD QELNESLFLI TAKQNCTDFD FLIDYLNNHQ DEVESALRNY GSVAIRGYKV 
KTPEQFQKVG LSIFPELRNQ YPGGAPRYKV AEYVWSASSV PSYQSICGHT ELSYSPSEQP
PYILFFSPQV AKSGGETPII NMKSVLSDLP EQLQQKCSHT RLVTKFYWVN TQKRLFDVRL
WKWPWFGFPK SWKAVFNTED KSLVEKKCFD AGRQIKWLAN DGLIAHYPMP IIGSHPITKE
IAWTGFFPWF HIWGVCIDAW FAAKYQRKFR SWLVFLILFL ITLGQICLEK LIPERCKYRA
LDVVFEDGSD LSFWDVYHIV KSYWKNTELF SWQEGDIVIL DNYRMGHGRL PFTGERQVYI
AFS