Gene Arth_3666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3666 
Symbol 
ID4443667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4120770 
End bp4122281 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content68% 
IMG OID639691490 
Productamino acid permease-associated region 
Protein accessionYP_833141 
Protein GI116672208 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID[TIGR03428] permease, urea carboxylase system 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTCCA CCATTTCGAC GGCGTCCGCC CACGCAGACG ATGCAGACCT GACCTCGCTC 
GGCTACCAGC CGTCCTTGCA CCGCAAACTT GGCCGCTACG CTTCCTTCGC CGCAGGATTC
TCGTTCGTTT CCATCCTGAC CACGATCTTC CAGCTCTTTG CCTTCGGGTA CTCCTTTGCC
GGACCCGCGT TCTTCTGGAC CTGGCCGGTG GTGCTGGTCG GCCAGCTGCT CGTTGCCCTG
AACTTCGCCG AACTCGCCGC CCGCTACCCT CTGTCCGGCG CCGTCTACCA GTGGTCCCGC
CGCATGGGCG GCGAAGTGGT GGGCTGGTTC GCGGGCTGGT TCATGGCGAT CGCCCAGGTT
GTCACTGCGG CCGCGGCCGC CATCGCACTC CAGGTGGTCC TCCCGCAGCT GTGGGACGGC
TTCCAGATTG TCGGCGGCGA TCCCGCATTG GCCACCGTCA CCGGCGCCGC CAACGCCGTA
GTCCTCGGGG CCGCACTCCT CGTGGTCACC ACGGTGATCA ACTGCCTGGG CGTCAAGCTC
ATGTCGCACG TCAACTCCAT CGGCGTGACC TGCGAAATCG TCGGCGTCGC CGCCGTCATC
CTCGCCCTGA TTTCCGCCGC CCAGCGCGGA CCCGAGGTAG TGGCGGACGT CAGCGTGGTG
GCCGGCTCGG ACCTCGGGGC CGTGGGCGCG TTCCTCGTCT CCGGCCTCAT GGCCGCCTAC
GTCATGGTCG GTTTCAACTC CGCCGGGGAG CTCTCGGAAG AGACGAAAGA CCCGCGCCGG
ACCGCACCCC GCACCATCCT TTCGGCCCTG GTCATTTCCG GTATCGGCGG CGGACTCATG
ATCATCACCG CACTCATGGC CGCCCCCAGC CTCGACGACG GACGCCTCGC CGCAGAGGGC
CTGCCCTATG TGCTCACCGC CGTCCTGGGA ACCTTCTGGG GCAAGGTCCT CCTGGTGGAT
GTCGCCGTCG CGATCTTCGT CTGCACCCTG GCCATCCAGA CGGCGGGATC CCGCCTGGTG
TTCTCGATGG CCCGCGACGG CAAACTGCCC GCCTCCGCGC TTCTGTCCTC GGTCCACCCG
GAGCGCGGCA CGCCGATGTG GCCGTCCATC GCCATCGGCG GCCTCGCCGT CGGCGTCCTC
GCCATCAACA TCGGCAACGC CGCCCTGTTC ACCACGCTCT GCAGCGTCTG CATCGTCATG
GTGTACCTGG CATACCTCCT GGTCACTGTT CCTCAGCTGC TCAGCCGCCT GCGCGGAGAC
TGGGACCGGG TGGGCCAGAC CATGCCGGCG GGACTGTTCT CGCTGGGACG CTGGGGGCTC
CCCGTCAACA TCCTGGCCGT TCTCTACGGC GCCGTGATGG TGGTAAATCT CGCCTGGCCG
CGTCCCGAGG TCTACGACCC CAGCGGCGAA AACGGACTCC TGCTGTTCTC CGCACCCCTC
ATGGTCGGGG CCGTGCTGCT CCTGGGTATC TGGGTGCGCA GCCGCAAGCC CGCCGACATG
CCCGCAGCCT AG
 
Protein sequence
MTSTISTASA HADDADLTSL GYQPSLHRKL GRYASFAAGF SFVSILTTIF QLFAFGYSFA 
GPAFFWTWPV VLVGQLLVAL NFAELAARYP LSGAVYQWSR RMGGEVVGWF AGWFMAIAQV
VTAAAAAIAL QVVLPQLWDG FQIVGGDPAL ATVTGAANAV VLGAALLVVT TVINCLGVKL
MSHVNSIGVT CEIVGVAAVI LALISAAQRG PEVVADVSVV AGSDLGAVGA FLVSGLMAAY
VMVGFNSAGE LSEETKDPRR TAPRTILSAL VISGIGGGLM IITALMAAPS LDDGRLAAEG
LPYVLTAVLG TFWGKVLLVD VAVAIFVCTL AIQTAGSRLV FSMARDGKLP ASALLSSVHP
ERGTPMWPSI AIGGLAVGVL AINIGNAALF TTLCSVCIVM VYLAYLLVTV PQLLSRLRGD
WDRVGQTMPA GLFSLGRWGL PVNILAVLYG AVMVVNLAWP RPEVYDPSGE NGLLLFSAPL
MVGAVLLLGI WVRSRKPADM PAA