Gene Arth_2575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2575 
Symbol 
ID4444834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2891786 
End bp2892742 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content68% 
IMG OID639690394 
Productmajor intrinsic protein 
Protein accessionYP_832054 
Protein GI116671121 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0580] Glycerol uptake facilitator and related permeases (Major Intrinsic Protein Family) 
TIGRFAM ID[TIGR00861] MIP family channel proteins 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.770584 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTCCC CCGTGCCTGC ACGCCAGACC ACAAGTCCCG ATTTGCGCTC CGCTCCGGAG 
GCTTTCACAC CCGGGCTTGC GGCACGTCTG TCGGCGGAGG CCTTTGGAAC CCTTTTCCTG
GTGATCGCCG GACTGGGCGT GCCGCTGTTC ACGATTCCGC AGTCCAACCC GTTGTCCGCC
TCGCTGGCCG CCGGGCTTGC CGTCACGGCC GCTATGCTCG CCTTCGCTTA TATCTCCGGC
GGCCACTTCA ACCCGGCAGT GACCCTGGGC AACGCCATCG CGGGCCGCAT CAGGCTGCCG
GAAGCTGCGG CGTACGTCGG CGCGCAGCTT GTCGGCGCCG CCGCCGGCAC CCTGGCACTG
TTCGGAATCC TGCGAACGGT GCCCAAAATC GAAGACACCC GCGCCGCCTT TGACACCGTG
ACGGCGGGCT TTGGCGAGCA TTCCATCATC CAGGCTCCCA TGGCCGGAGT CCTGCTCGTC
GAGGTACTCG GCGCTGCGCT CCTGGTAGCC GTCTTCCTGG GGACCACAGC TGCCCGCAAC
ACCAACAAAG CCGCAGCCCC CTTCGCGGTG GGACTCACGC TCGCCGTCCT GCTGCAGCTG
GGGCAGGCCG TGGGCAACAC ACCGTTCAAT CCGGCGCGCG CCACCGCGTC GGCCATCTTC
AGCAACACCT GGTCCCTCGA GCAGCTGTGG CTCTTCTGGG TGGCCCCGCT GGTGGGTGCC
GCCATCGCGG GCCTTGTGTT CCGCGGGTTC GCCGACACCC CTGCGGCTGC TCCGTCCCCC
GCCCAGGCGG ACGCCGACGA CGCTGCGCAC GAGTCCGATG ACGATCTGCA CGACGATGTT
GACGATGACA CCACCGGTTT TGAAGGTGAC GCCGCCGGCC GCTCCGACGC GCCGGCTGCC
GGCGCCTCTG CCAACGATGA CGTCCGGGAC TTCTTTGACG GCAAGCGCGG ACAGTAG
 
Protein sequence
MTSPVPARQT TSPDLRSAPE AFTPGLAARL SAEAFGTLFL VIAGLGVPLF TIPQSNPLSA 
SLAAGLAVTA AMLAFAYISG GHFNPAVTLG NAIAGRIRLP EAAAYVGAQL VGAAAGTLAL
FGILRTVPKI EDTRAAFDTV TAGFGEHSII QAPMAGVLLV EVLGAALLVA VFLGTTAARN
TNKAAAPFAV GLTLAVLLQL GQAVGNTPFN PARATASAIF SNTWSLEQLW LFWVAPLVGA
AIAGLVFRGF ADTPAAAPSP AQADADDAAH ESDDDLHDDV DDDTTGFEGD AAGRSDAPAA
GASANDDVRD FFDGKRGQ