Gene Arth_4102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4102 
Symbol 
ID4447692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4623211 
End bp4624284 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content68% 
IMG OID639691933 
Producthypothetical protein 
Protein accessionYP_833577 
Protein GI116672644 
COG category[R] General function prediction only 
COG ID[COG5006] Predicted permease, DMT superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCTGCCG CAAGCAAACG CGCCGATGCG TCCTTCCAGG GCGCCGCGCC CAGGGTCGGC 
GCCCCCCGCC GGCCGGCTGC CGGGTTCCTG GCATCCGGGC TCGGCGTGGC GCTATTCTCT
TCAGCCGTCT TCGGCCTTTC CGGATCTTTT GCCAAGGCCC TGCTCGAAAC CGGGTGGACT
CCGGGTGCGG CCGTGACCGC GCGCCTGACC GGGGCAGCCC TCATCCTCGC GATCCCGGCA
GTGCCCGCAC TGCACGGCCG CTGGCGCCAG CTGAGGGACA ACTGGCTGAC CATCCTGCTG
TTCGGGCTCA TCGGGGTCGC AGCCTGCCAG CTGTTCTACT TCAACGCCGT CGAGCGGCTC
TCCGTGGGCG TTGCCCTGCT GCTGGAGTAC CTGGCCCCGG TGATCATCGT CCTCTGGCTG
TGGGCCGCGA GCCGACGGCG TCCGCGCCCG CTCACCATTG CGGGAACGCT GCTTTCGCTG
GGCGGACTCA TCCTGGTGCT GGACCTTACC GGTGCCGTGA AGATCGACGT CGTCGGCGTC
CTGTGGGGAG TCGCCGCAGC CGTCTGCCTG GCGATCTATT TCTTCATCAC TGCAAAGGAA
AATGACACCC TCCCGCCGAT CGTCCTCGCA TCCGGCGGCC TGATGGTGGG CGCCGTGGTG
ATGTGGCTGG CGGCTGCCAC CGGACTTCTG CCGATGGCGT TCAGCACGGC GGACACCAAA
CTGGGGCCGT GGGTCACACC GTGGTGGGTT TCGCTGGGCG GCCTAATCAT CCTTGCCACG
GTCCTCGCGT ACGTCTCGGG CATCGTTGCC GCGCGGGCGC TCGGTTCAAA GGTTGCATCA
TTCGTGTCGC TCACCGAGGT GCTTTTCGCC GTCATCTGGG CGTGGCTCCT GCTCGGTGAA
CTGCCCGGTC CTATCCAGCT CCTCGGCGGT GTGCTGATTG TTGGCGGCGT CGTTCTGGTC
CGCGTGGACG AGCTCCGCGG GCCGCGGGTA GCGCCGGCCT CTTCAGGCGG ACCCGGTGCA
GCGCCGGTGC CGGCGCCGCT GGACCACGCG AACGACGTCG AACCCGTCCC CTAA
 
Protein sequence
MPAASKRADA SFQGAAPRVG APRRPAAGFL ASGLGVALFS SAVFGLSGSF AKALLETGWT 
PGAAVTARLT GAALILAIPA VPALHGRWRQ LRDNWLTILL FGLIGVAACQ LFYFNAVERL
SVGVALLLEY LAPVIIVLWL WAASRRRPRP LTIAGTLLSL GGLILVLDLT GAVKIDVVGV
LWGVAAAVCL AIYFFITAKE NDTLPPIVLA SGGLMVGAVV MWLAAATGLL PMAFSTADTK
LGPWVTPWWV SLGGLIILAT VLAYVSGIVA ARALGSKVAS FVSLTEVLFA VIWAWLLLGE
LPGPIQLLGG VLIVGGVVLV RVDELRGPRV APASSGGPGA APVPAPLDHA NDVEPVP