Gene Tery_1890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1890 
Symbol 
ID4242695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2897004 
End bp2898995 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content43% 
IMG OID638107011 
Productintegrin, beta chain-like 
Protein accessionYP_721619 
Protein GI113475558 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGA AAACTAAAAG CAGTATCACA AATATAGTTG ATCGCAAAGG AGGCTTAAAT 
CCAGATGTAG TAGATGTAAC TCTTGTACCT GGAGACAATG TGACCTTTGA TATCACTGCC
AAAGTTACTA AAAAAAGTTC CACAAAATTA CCTCTAGACC TAGTTTTTCT TAGCGATCTT
TCTGGTTCTT ATGGGGATGA CCTGCCAGTA TTACAGGATT TAGTTCCCAA GCTAGTTTCC
TCAGTTCGAG ACATCCAACC AAACAGTCAA TTCGGTCTAG CATCATATAT AGATAAACCC
AAAGATCCCT TTGGCGGCCC TAAAGATTTT GTGTATAGAA TGGAGTCAGC GATCACTAAA
TCTCGCACTG ATTTTCAGAA AGCGATGGAT GACTTGAAAA TTGGTAACGG TAATGACGGT
CCTGAGGCAC AGCTTGAGGC ATTGATGCAG TTAGCTCTCA GAGAAAAAGA GATAGGCTTC
CGTAAGAAAT CTCGGCGCGT TGTCGTTCTT TCTACAGATG CTAACTACCA CAAAGCAGGA
GACGGCAAAA AAGCAGGTAT CAAAACTCCC AATAATGGGG ATACAGTTCT TGATGGTAAG
CCAGCCGGTA CTGGGGAAGA CTATCCCAGT ATCGATCAAG TAAGAGATGC TTTACAAGAA
GCCGGCATTG TTCCCATTTT TGCAGTCACC GGCAACCAAG TCAGAAACTA CAAAAAACTC
GTAGATAAAT TGGGGTTTGG TACGGTAGAA AGGTTATCGC GGGACAGTTC TAACTTAGTT
AAGGTAGTAA CAGAGGGCTT GGAGGAAGTC TTTAGTGACT TAACGATAGT ACCCCAAAGC
GATGAGTTTG GTTACATTAA AAGTATTAAG CCTACGACTT ACGAAAATGT CCGCCCTGGG
CAAAGTCGGA CTTTTGAAGT TAAACTGGGT ATTACAGATC TCGATGCTAG CCAAAAAGAC
CGTCTTTCCC TGGAAGTATT GGGTTATGGG GAAACCAAAG TTAATGTTAC TCCTATTGTT
AACACCAAGC CCATCGCAAG CAATGACAAC CTCGCCACTA ATGCAGGAAG CAAATTGGTT
ATTAAACCAA AAGAGCTATT GGCAAACGAT ACGGATAAAG ACGGGGATAA GTTAAGTATT
AGCAAGGTAG GTAAAGCTTC AAATGGCAAA GTGATCCTGG GTAAAAATGG TAAGGTAACT
TTTACCCCCG ATAAAGACTT TACGGGCAAG GCCAGTTTTG AATATACCAT CGACGACGGT
AATAAAGGCC GTGATAGCGC GACTGTTACA GTTCAAGTCA GAGATAATTC TGACCCCATC
GCTAAAGACG ACAAGGTGTT TTTTGTCAGA CCCAAGCTCT TTCACGCTAT CCAAGCAAAA
AAGTTACTGA AAAATGATCA AGATAAAGAT GGCGACAAGT TGACTATTAT TAAAGTCAGT
AATGCAACCA AGGGCGAAGT AGAGTTAACT AAAAGCGGTG AAATAACTTT TACCCCTAGT
GGAAAGCATA AGAAGTTCAG CAAGGGGAGT TTTGAGTATA CTATCAGCGA CGGCAAAGGC
GGTACAGATA CAGCCAAAGT GATGCTGAAA AGAGTTGGGG ACTTGCCAAG CTCTAAGCGT
AGCGCTGGTT CTGAGAAGAG AGACTCCCTG ACTGGAAATA TAGATGACAA GGCACCAGGG
ATCCCTCTTG GTACAGTTGT TGATCCTCTC ACCCAAAGCA GTGATATTGG CTTCAAGAAT
GGAGGCAAAG TTGATCAAAA CGACTATTAC AACTTTGTTG TTCCGGAGCC TAGTTTCGTC
AGCATCAAAC TTGACGGTCT CAGGAGCAAC GCTAACCTAG AACTATACGA TAGCGACAAA
GTATCCCTTG ATAGTTCTAC TAACTCCGGC AATGCTCCTG AAGAGATTAA CACCTTCTTG
TTTCCCGATA CCTATGTGGT TGGCGTATTC GATCAAGGTA GTGGAACTCC TTACAACCTG
TCTATCTTAT AA
 
Protein sequence
MAKKTKSSIT NIVDRKGGLN PDVVDVTLVP GDNVTFDITA KVTKKSSTKL PLDLVFLSDL 
SGSYGDDLPV LQDLVPKLVS SVRDIQPNSQ FGLASYIDKP KDPFGGPKDF VYRMESAITK
SRTDFQKAMD DLKIGNGNDG PEAQLEALMQ LALREKEIGF RKKSRRVVVL STDANYHKAG
DGKKAGIKTP NNGDTVLDGK PAGTGEDYPS IDQVRDALQE AGIVPIFAVT GNQVRNYKKL
VDKLGFGTVE RLSRDSSNLV KVVTEGLEEV FSDLTIVPQS DEFGYIKSIK PTTYENVRPG
QSRTFEVKLG ITDLDASQKD RLSLEVLGYG ETKVNVTPIV NTKPIASNDN LATNAGSKLV
IKPKELLAND TDKDGDKLSI SKVGKASNGK VILGKNGKVT FTPDKDFTGK ASFEYTIDDG
NKGRDSATVT VQVRDNSDPI AKDDKVFFVR PKLFHAIQAK KLLKNDQDKD GDKLTIIKVS
NATKGEVELT KSGEITFTPS GKHKKFSKGS FEYTISDGKG GTDTAKVMLK RVGDLPSSKR
SAGSEKRDSL TGNIDDKAPG IPLGTVVDPL TQSSDIGFKN GGKVDQNDYY NFVVPEPSFV
SIKLDGLRSN ANLELYDSDK VSLDSSTNSG NAPEEINTFL FPDTYVVGVF DQGSGTPYNL
SIL