Gene Tery_1341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1341 
Symbol 
ID4242801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2042045 
End bp2043895 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content32% 
IMG OID638106518 
Producthypothetical protein 
Protein accessionYP_721129 
Protein GI113475068 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.100096 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGCAA AAAAAAAATC AGAAGTTAAT TTTATAATCC TCACAATTAT CACAATTCTA 
CTACCTGCTT TTTATGTACT TTTTATGATA TCAAAGGCGG GAGAATTACT AAATTTCGAT
TATTGGTGGA TGATTAAAAA TATCTATTCT ATAGATGGTT TCTCCACTAA TATTTTTGAC
TGGATCTTTC GGGCAAATGA ACATTTTGTC TTAATTCCTG CCATAATTTA TGCCCTGAAT
ATTGTTATTA CTAAAGGTTC CAATATTGGG TTATGTCTAA CTACATTTTT CCTAGCTTGT
GTTCAGGGAA TTTTATTAAA AATCTTAGTA CCTAATACTC TCAAAAAACA TCGTCCGATA
CTTTTCTTAC TAATTTTATT TATCTCAGTT TTTAACTTTA CCCCTGCTGC TGCTCATAAC
TGGATGCGCG GATATAGTGG AGTACATTGG GTAATTGCTA ATTTATTTGT CATTGCCTCA
ATTTTTTGCG TCAAAAAATT ACTAGAATCT CAACAAAATA GATTTGCTAT TACTAGTATA
ACCTTGGGAA TTTTAGGATG TATTAGTTAT AGTACCGCTC TAGGAATTTG GCCTATATTA
TGTGGAGTTG CCATTCTATA TAAATTGCCA AAAAAGTTGA CTTTCTCTTA TCTATTTTTT
TCTGTTTTAG TAATAGGTAT TTATTTTATC ACCTACACAA CACCCTCTCA TCATCCCTCA
TTATCTAAAC TGAATTTTCT TGATATAGTT ACTTACATTC CTATTTATTT AGGAGCAATT
TTTACTCATA ATATTTCCCT GGCTTTGGCA ATAGGTTGGG TAGGATTAGT TTTAGCAGGA
ATATTTTTAA TTTATTGGTT ATTTATAATT TATCCTCAAG ATTGGTTGCC CTGGTTATCA
ATAATAATTT ATACTTTGGG TACTGCTTTG ATGGCTGCTG TTAGTCGTTC TGGGTTTGGA
ATAGAACAAG CGATCGCTTC TCGTTACGGA ACCCTACCTG CTCTATTCTG GTTAAGTCTA
ATTATTCTGA TTTTTTTATG GTTAAAACAA CAACAATTTA CCCCAAGAAG ACAATGGTAT
TTTGTTGCTC CATTAGTGGC ACTTTTGACT ATTTTGATTA TATTAATGTA TCGAGTAGGT
ACAGAAACTT TTAAAGAAAT TGCTCATCGG GCAACTTTTC AACCTTTAGT AGCATTATCC
TTACAAATAG GAGTTTTAGA TCCGACTTTA ATTCAAGAGA AAGTTGGTAA CCGACCTGCT
GCTTTTTTAG GGTTAGTAGA TGCTTTGAAA TCTGATAGTT TAGTACCTTT TAATCGAGAT
ATAAAAAAGG ATAATTTTTG TGCTAATTTG GATGAGAAAA TTAATTCTAA TTTATTAACT
GGAAAACTGC CAGAAAATTG GCAGGGATAT TTTGATAATG TGACTAAATT TTCTCCAACT
ACAGCAAGAG TAAATGGATG GGTTAGTAAA GTTAAAAGTA AACTCCCCTC TTACTCTTCC
CAAGCTAGGA AGTCAGACCC ACTATTACCT CCTTCAAACT GGGAAGTAAA AAGTCAAGAG
AATGTTCAGA TTAAATGTAT TGCTATTTTG AATCAAGAAA ATGTAGTAAA AGGTTTTGGA
ATGTCTGGTT TTCCTCGTGC TGATGTAGCA AATTTATTAG GAGCAGAATA TGAATTTTCA
GGTTGGAAAG GATATATTGA GGTCAAAAGT GAAAAGTCAA AACTCGAAAG TCAAGAGAAT
GTTCAAATCT CAGCCAAGGA AATTCTAACA GCTTATGTTA AGTTGAAAAA TCGTCAAGAT
TGGATAGCTT TAACAAATAA ACATAGTTTT GATGGTGGTA GTGCATTATA A
 
Protein sequence
MLAKKKSEVN FIILTIITIL LPAFYVLFMI SKAGELLNFD YWWMIKNIYS IDGFSTNIFD 
WIFRANEHFV LIPAIIYALN IVITKGSNIG LCLTTFFLAC VQGILLKILV PNTLKKHRPI
LFLLILFISV FNFTPAAAHN WMRGYSGVHW VIANLFVIAS IFCVKKLLES QQNRFAITSI
TLGILGCISY STALGIWPIL CGVAILYKLP KKLTFSYLFF SVLVIGIYFI TYTTPSHHPS
LSKLNFLDIV TYIPIYLGAI FTHNISLALA IGWVGLVLAG IFLIYWLFII YPQDWLPWLS
IIIYTLGTAL MAAVSRSGFG IEQAIASRYG TLPALFWLSL IILIFLWLKQ QQFTPRRQWY
FVAPLVALLT ILIILMYRVG TETFKEIAHR ATFQPLVALS LQIGVLDPTL IQEKVGNRPA
AFLGLVDALK SDSLVPFNRD IKKDNFCANL DEKINSNLLT GKLPENWQGY FDNVTKFSPT
TARVNGWVSK VKSKLPSYSS QARKSDPLLP PSNWEVKSQE NVQIKCIAIL NQENVVKGFG
MSGFPRADVA NLLGAEYEFS GWKGYIEVKS EKSKLESQEN VQISAKEILT AYVKLKNRQD
WIALTNKHSF DGGSAL