Gene Moth_1345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1345 
Symbol 
ID3831903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1390704 
End bp1392269 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content55% 
IMG OID637829281 
Productamino acid permease-associated region 
Protein accessionYP_430201 
Protein GI83590192 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000295081 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACGG CAACAGAAAA ATTGGAGCTC CGCCGGGAGG TTACCGTGTG GGGCTCCTAC 
ATGTGGGGTT ACGCGGATGT CGGTGCCGAC ATTTATGCCG CCCTGGGACT GGTTATGCTC
TGGGCTAAAG GTGCTACCAG CCTGGCCTTC GCCTTGGCCG GTCTGGTTTA CATCATGATC
GGCCTGGCTT ATACCGAGCT TGCCGCCGCT TACCCCGTTG CCGGCGGTGG CCAGTTTTTT
ACCTTGCGCG GGCTGGGTGA TTTCTGGGGT TTTGTAGCCG GGGCAGCTTT GCTCTTGGAC
TACACCATAG ATATCGCCCT TTTTGCGACT GCTTCCGCTG GCTACATCAA CTTCTTTCTG
CCTTACCTTT TCGGGGTCAA TATTGACTCT CTGGCCGTCA GCATCGGTCC CCTGCACCAC
GTCAACCTGG TGTGGATGGC AGAGGCCCTG GCCCTGGTGT TCTTCCTTAT TGCCCTCAAT
ATTCGCGGTA TGAGGGAATC TTCTTTGCTC AATGAAGTCC TGGGTGCTAT TGATATCTTG
ACGGAATCAA CCATTATCGT CTTTGGGTTT CTCTTTGCCT GGCGGCCGGA ACTCCTGGCC
CATCAGTGGG TGACTCAGTT CCCGACTTTT AAGGAATTTG CCTATGGCTC CTCCCTGGCT
ATTATCTCCT TTGTCGGCCT GGAGTCTATC TCCCAGGCGG CCCAGGAAAC CAAACGGCCG
GCGACTGTCG TTCCCCGAAC TTCTGTCGGC CTGATCTTTA CTGTATTCAT TTTCGCCACC
GCCTTTTCTA CCATGAGCCT GGGAGTCCTA CCCTGGCAGG ATATCGCTAA AGCCGTCGGC
GATCCGGTGG CTACCCTGGC CCATGCCATC CCCTTTATTG GCATTATTGC CGGCCCTTTT
GCCGCCCTGC TGGGGGCTAC CATCCTGCTC ATTTCGGCCA ACTCCGGGGT CATGAGCGCT
TCCCGGTTGA CCTTTGCCAT GAGCCAGTTT AATTTTATCA GCGACTGGTT CAACGCCGTG
CACCCTCGTT ACCGGACTCC TTACCGCACT ATCCTTGTCT TCTCGGGCAT TGGTATCCTT
CAGTTAGTTC TTTCTTTCCT GACGCCCAAT GCTATGGATA CCCTGGGTAA CATGTATGCC
TTCGGGGCTA CCACAGGCTA TATCCTGGTT TTTATTGCCC TGATTAAACT ACGCTTCACC
GATCCCTATG CACCCCGGCC CTATAAAGTG CCTTTAAACA TAAAGATAAA TTATCGTGGC
CGGGTGGTGG AGTTCCCCAT CCTGGGGGTA ATCGGTACCC TGGGTATCAG CACTATACTC
TTCGAAGTCA TTCTTACCCA TGCCATTGGC CGTATCGCCG GGCCGGCGTG GATTATCCTC
TGTTTCCTCT ACTACGCTTA TTATCGCCGG AGCAAGGGCT ACCCCATTTT CGGCAACATC
CCCCGGGACT GGGAAGCCCA GCAAATAAAG GTCCTGGAAG CGGCCGAAGA GTACGACCTT
CTCGAGGAAT ACAAGCAGGC TCTTGCAGAA CGGGAACGCC TGGAGGCTAA GGCCCATGTC
AAGTGA
 
Protein sequence
MATATEKLEL RREVTVWGSY MWGYADVGAD IYAALGLVML WAKGATSLAF ALAGLVYIMI 
GLAYTELAAA YPVAGGGQFF TLRGLGDFWG FVAGAALLLD YTIDIALFAT ASAGYINFFL
PYLFGVNIDS LAVSIGPLHH VNLVWMAEAL ALVFFLIALN IRGMRESSLL NEVLGAIDIL
TESTIIVFGF LFAWRPELLA HQWVTQFPTF KEFAYGSSLA IISFVGLESI SQAAQETKRP
ATVVPRTSVG LIFTVFIFAT AFSTMSLGVL PWQDIAKAVG DPVATLAHAI PFIGIIAGPF
AALLGATILL ISANSGVMSA SRLTFAMSQF NFISDWFNAV HPRYRTPYRT ILVFSGIGIL
QLVLSFLTPN AMDTLGNMYA FGATTGYILV FIALIKLRFT DPYAPRPYKV PLNIKINYRG
RVVEFPILGV IGTLGISTIL FEVILTHAIG RIAGPAWIIL CFLYYAYYRR SKGYPIFGNI
PRDWEAQQIK VLEAAEEYDL LEEYKQALAE RERLEAKAHV K