Gene Tery_0097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0097 
Symbol 
ID4243038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp132007 
End bp133308 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content39% 
IMG OID638105455 
Productamino acid permease-associated region 
Protein accessionYP_720074 
Protein GI113474013 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00415721 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCTA CACCTCAACT TCGGCGAGAA ATTGGCGTGT TTGGCGCTAC CCTCATGGGT 
AATGGTTCCA TTCTTGGTAC CGGAGTATTT GTCAGTATTG GTATTGCTGC TAGCATTGCA
GGTCCGTCGG TAATTATTGC AGTAGCGGTA GCTGGTGTTG TTGCTACCTG CAATGCTTTT
AATAGTGCCC AGTTAGCGGC AAACCATCCC GTGAGTGGGG GTACTTATGA ATATGGTTAC
AAATATCTAA ATAATTGGTT GGGTTTTATT GCCGGATGGA TGTTTCTATT CGCTAAAAGT
GCTTCTGCTG CTACTGCTGC TTTAGGTTTT GCTGGTTATT TCCTTAATGC TTTTGGTGTG
AATAATAACA CTTGGTTAGT ATTGACTGCT TTAACTGCGG TGGTTGTATT AACTATAGTT
GTGTTAAGCG GAATTCGTCG TTCTAATGTT ACTAATATTA TTATTGTTTC TATTACCTTA
TTTTCTCTAG TTCTGTTTAT TCTGGCAGGG GTACCTCAAG TGGTCTTCTC TGAAGGGAAG
AATTTAATGC CTTTCTTTCC AGGGGATAAA CCTATAGCAT CTTTACTGCA AGCCACTGCT
TTAATGTTTG TCGCTTATAC TGGTTATGGT CGTATTGCCA CTTTGGGCGA GGAAGTAAAG
GAACCACGAC GTACTATTCC TAGAGCGATC GCTCTAACTA TGATTTTTAC CTGTGTATTA
TACATATCTA CCGCTATAGT TAGTGTATTT GCGGGTCAAG AAGTAATTCA AAAATTATCT
CAGGCAGATG TAAGTCTAGT TGCACCTTTA GAAGTTATTG CTAAAGTGGG AGGTTTGGGT
ATTCCTGTTA TTCCTAAGTT AATTGCTATT GGTGCTATTA CGGCAATGTT GGGAGTACTG
CTAAATCTGA TTTTGGGGTT GTCGCGGGTT TTGTTAGCAA TGGGGAGGCG ACGAGATGTA
CCTAAAGTTG TTGCTAGGTT GGATAGTGGG CAAACTACAC CTTATATAGC TGTAGTAATA
GTAGGAGTAG CGATCGCTTG TTTAGTTTTA ATTGGAGATG TGAAAACTAC TTGGTCTTTT
AGTGCTTTTA ATGTATTAAT TTATTATGCT ATTACTAATT TTGCTGCCCT CCACCTTTCT
CCAGAAGAAA GACTTTATCC TAAATGGTTA GGTTGGCTAG GTTTAGCTTC TTGTTTATTT
TTAGCTTTTT GGGTAGAAAA GCAAATTTGG TTAGTAGGTT TAGGATTAAT TCTTATTGGT
TTAATTTGGC ATAGTTTGAT TCATAGATTT ATAACTGATT AG
 
Protein sequence
MASTPQLRRE IGVFGATLMG NGSILGTGVF VSIGIAASIA GPSVIIAVAV AGVVATCNAF 
NSAQLAANHP VSGGTYEYGY KYLNNWLGFI AGWMFLFAKS ASAATAALGF AGYFLNAFGV
NNNTWLVLTA LTAVVVLTIV VLSGIRRSNV TNIIIVSITL FSLVLFILAG VPQVVFSEGK
NLMPFFPGDK PIASLLQATA LMFVAYTGYG RIATLGEEVK EPRRTIPRAI ALTMIFTCVL
YISTAIVSVF AGQEVIQKLS QADVSLVAPL EVIAKVGGLG IPVIPKLIAI GAITAMLGVL
LNLILGLSRV LLAMGRRRDV PKVVARLDSG QTTPYIAVVI VGVAIACLVL IGDVKTTWSF
SAFNVLIYYA ITNFAALHLS PEERLYPKWL GWLGLASCLF LAFWVEKQIW LVGLGLILIG
LIWHSLIHRF ITD