Gene Amir_2101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_2101 
Symbol 
ID8326290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2322692 
End bp2324194 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content71% 
IMG OID644942651 
ProductNCS1 nucleoside transporter family 
Protein accessionYP_003099892 
Protein GI256376232 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCCCTCG ACCCCTCGGC GCAGATCGCG TCCGACCAGG AAGGGCGCGT GGACCTCGTG 
GACCGCGCCG CCATCGCCGA CAGCCCGTAC TACAACCCGG AACTGGCCCC GGTGCCGCTG
GAGGGGCGCA CCTGGACCAC CTACAACTTC TTCGCGCTGT GGATGGGCAT GGCGCACAAC
ATCCCCAGCT ACACGCTCGC CGCGTCGCTG GTGGCGCTGG GCATGGACTG GGTGCAGGCG
CTGCTCACCA TCACGCTCGG GAACCTGATC GTGCTCGCGC CGATGCTGCT CAACAGCCAC
GCGGGCACCA AGTACGGCAT CCCGTTCCCG GTGTTCGCCC GCGCGTTCTA CGGGGTGCGC
GGGGCGAACC TGGTGGCGCT GCTGCGGGCG TTCGTGGCGT GCGCGTGGTT CGGCATCCAG
ACCTGGGTGG GCGGCAAGGC GCTGCACGTG ATCGTCGGGC GGCTGGCCGG TGAGGCGTGG
ACCGGCGCGC CGGTGGTGCT GGGCCAGGTG TGGACGCTGT GGCTGTGCTT CCTGGTGTTC
TGGGCGGTGC AGATGCTGGT GATCTGGCGG GGCATGGAGG CGATCCGGCG GTTCGAGAAC
TGGACGGCGC CGCTGGTGTC GGTCGGGTTC CTGGTGCTGC TGGCGTACGT GGCGGTGAAG
GCGGGCGGGT TCGGGCCGAT CCTGTCCGAG CCGTCGAAGC TGGGCTGGGG CCCGGACTTC
TGGAAGGTGT TCTTCCCCGC GCTGATGGGG ATGATCGCGT TCTGGTCGAC GCTGTCGCTG
AACATGCCGG ACTTCACCCG GTTCGGCGGC AGCCAGCGCA AGCAGGCGCT CGGGCAGGTG
CTGGGGCTGC CGACGACGAT GACGTTCATC GCGGTGGTGG CGATCCTGAC CACCTCGGGC
GCGCAAGCCC TGTACGGCGA GGCGATCTGG GACCCGGCGG AGCTGGCGAG CCGGTTCGAC
AGCACGGCGG TGGTGCTGGT CGCGCTGGTG TCGCTGGTGC TGGCGACGGT GTCGGCGAAC
CTGGCGGCGA ACGTGGTCAG CCCGTCGTAC GACTTCTCCA ACGCGTTCCC GCGCCGGATC
AGCTTCGCCG TGGGCGGGTT GATCACGGGT GTGCTCGGCG TGCTGATCCA GCCGTGGCGG
CTGATCTCCG ACCCGGGCAT CTACATCTTC GCGTGGCTCG GGTTCTACGG CGGGTTGCTG
GCGTCCGTGG CCGGGGTGCT CGTCGCCGGG TACTGGGTGC TGGCGCGGAC CCGCCTGGAG
CTGCCCGACC TGTACCTGTC CGGGCGCGGG GCCTACTGGT TCACCGGCGG CTGGAACTGG
CGGGCGGTGG TGGCGACCGC GCTCGGGTCG CTGCTGGCCG TCGGCGGCGC CCACGGCGGG
CCGTTCCCCG CGGACGGGCT GATCCCGCCG CTCAAACCGC TCTACGACTA CAACTGGGTC
GTCGGCCTGG TGGTCGGCAT GGCCGGCTAC CTGGTGCTGG CGCCACGGAA GGAGCAGGCG
TGA
 
Protein sequence
MALDPSAQIA SDQEGRVDLV DRAAIADSPY YNPELAPVPL EGRTWTTYNF FALWMGMAHN 
IPSYTLAASL VALGMDWVQA LLTITLGNLI VLAPMLLNSH AGTKYGIPFP VFARAFYGVR
GANLVALLRA FVACAWFGIQ TWVGGKALHV IVGRLAGEAW TGAPVVLGQV WTLWLCFLVF
WAVQMLVIWR GMEAIRRFEN WTAPLVSVGF LVLLAYVAVK AGGFGPILSE PSKLGWGPDF
WKVFFPALMG MIAFWSTLSL NMPDFTRFGG SQRKQALGQV LGLPTTMTFI AVVAILTTSG
AQALYGEAIW DPAELASRFD STAVVLVALV SLVLATVSAN LAANVVSPSY DFSNAFPRRI
SFAVGGLITG VLGVLIQPWR LISDPGIYIF AWLGFYGGLL ASVAGVLVAG YWVLARTRLE
LPDLYLSGRG AYWFTGGWNW RAVVATALGS LLAVGGAHGG PFPADGLIPP LKPLYDYNWV
VGLVVGMAGY LVLAPRKEQA