Gene Arth_3724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3724 
Symbol 
ID4443725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4193262 
End bp4194950 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content66% 
IMG OID639691548 
Productamino acid permease-associated region 
Protein accessionYP_833199 
Protein GI116672266 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID[TIGR03428] permease, urea carboxylase system 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAGAAC CCACCACATC GGCACCCAGC AAAAGCGCCG ACTCAAGCGG CATGGACGAA 
TTCGGCTACG CCCAGACCCT GGACCGCAGC ATCGGCAAGT TCGCCAGCTT CGCCGCGGGA
GTCAGCTACA TCTCCATCCT CACCGGCGTC TTTCAACTGT TCTACTTCGG TTTCTCCATG
GCCGGGCCGG CCTACGCCTG GTCCTGGCCC CTCGTCTTCG CCGGCCAACT GATGGTGGCA
CTGTGCTTCG CCGAACTGGC AGGCCGCTAC CCGGTGGCCG GCTCGGTCTA CAACTGGGCC
AAGCGGCTTT CCTCCGGGAC GTGGGCATGG CTGGCCGGCT GGCTGCTCCT GCTCTCCTCC
ATGATGGCGC TTGGGGCGGT CGCCCTGGCC CTGCAGCTCA CGCTGCCGCA GATCTGGTCC
GGCTTCCAGT TCATCGGGGA CGGGACCGGT CCGTACGACT TCGCCGTCAA CGGCGTGATC
CTGGCCAGCA TCATGATCGG CATCTCCACC CTCATCAACG CATTCGGCGT GAAGCTGATG
ACGCGGATCA ACAGCATCGG CGTCTTTGTG GAACTGGCCG CGGCCGTGCT GCTCATCCTG
GCGCTGGGCT GGCACGTAGT GCGCGGTCCC GAAGTCCTCT TCGAGACCGC CGGTTACGGC
GACGACCACC CGCTGGGATT CTTCGGGGTG TTCCTCATCG GCGCCATGGC CTCCGGCTAC
GTCATGTACG GCTTCGACAC TGCGAGTTCC CTTGGCGAGG AGACCAAGGA CCCGAAGCGC
ACCGCCCCCA AAGCAATCCT CCGCGCCGTC ACCGCATCGT TCGTGCTCGG TGGCCTGATC
CTCCTCGGCG GGCTGCTGGC TGCGCCGGAC CTGAACGACC CGAAGGTGGG CGCTGCGGAC
GGCGGACTGC AGTACGTGGT CCTGTCCGTG CTGGGCGGGC CTTTCGGCAA GGCATTCCTG
GTCTGCATCG TGGTGGCTGT CGTGGTCTGC ACCCTGGCCG TCCACGCCGC CGCCATCCGG
ATGATGTTCG CCATGGCGCG GGACAACAAC CTGCCCTTTA GCCGCCAGCT CAGCAAAGTG
GATCCGGCCC GCAAGACGCC CACTGTTGCC GCCATCGTCA TCGGCATCCT GGCCGTCGTC
CCGCTGATCG TCAACATCAC GCAGCCGGCA ATTTTCACCA TCATGTCCAG CATCAGCATC
GTCCTGATCT ACCTGTCCTA CCTGCTGGTC ACGGTGCCGA TGCTGCGGAA GCGGCTGCAG
AAGAAATGGC CGCTGGCTGA AGACGACACC GAACCCGGCT TCAGCCTGGG CAAATGGGGG
ATGCCGGTAA ATATCCTCGC CGTACTGTGG GGCGGTGCCA TGACCCTCAA CCTGATCTGG
CCGCGCCCGG AGATCTACAA CTCCGTGCCG CCGTTCGAGT GGTACCTCCA GTGGGGCGGC
GTCATCTTCG TCGGTGCAGT CGCGATTGGC GGGGCGTTGC TCTACCGTCT GAGGATCAGG
CACCGCACGG GCGTCCTGGC GGAGCACGCC GCAGTGCCCG CCGCAGTCCG GGCAGTAGCG
CACGCCGCCG TCCCGGGCCC GGGCCCGGAT CAGGTCCGGG TTTCGGGCGC CGGCCAAGGC
CCCGGCCAGG GCGCCGATCA GGGCCTTGAG CCGCTAGAAG AAGATTTGGA ACCGCAACGA
GTCAGCTAG
 
Protein sequence
MLEPTTSAPS KSADSSGMDE FGYAQTLDRS IGKFASFAAG VSYISILTGV FQLFYFGFSM 
AGPAYAWSWP LVFAGQLMVA LCFAELAGRY PVAGSVYNWA KRLSSGTWAW LAGWLLLLSS
MMALGAVALA LQLTLPQIWS GFQFIGDGTG PYDFAVNGVI LASIMIGIST LINAFGVKLM
TRINSIGVFV ELAAAVLLIL ALGWHVVRGP EVLFETAGYG DDHPLGFFGV FLIGAMASGY
VMYGFDTASS LGEETKDPKR TAPKAILRAV TASFVLGGLI LLGGLLAAPD LNDPKVGAAD
GGLQYVVLSV LGGPFGKAFL VCIVVAVVVC TLAVHAAAIR MMFAMARDNN LPFSRQLSKV
DPARKTPTVA AIVIGILAVV PLIVNITQPA IFTIMSSISI VLIYLSYLLV TVPMLRKRLQ
KKWPLAEDDT EPGFSLGKWG MPVNILAVLW GGAMTLNLIW PRPEIYNSVP PFEWYLQWGG
VIFVGAVAIG GALLYRLRIR HRTGVLAEHA AVPAAVRAVA HAAVPGPGPD QVRVSGAGQG
PGQGADQGLE PLEEDLEPQR VS