Gene Arth_3054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3054 
Symbol 
ID4444287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3424529 
End bp3426049 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content66% 
IMG OID639690880 
Productamino acid permease-associated region 
Protein accessionYP_832533 
Protein GI116671600 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.479156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACAG GGACGACGAC GGCGGGAGGA CCCGGGGGAA GCTCCGGCGG CCCGGGACCG 
GAGGTCCCGT CCAAGGGGCT GCGCGCCGGC ATCCTGGACC TGGGCGACTC GGTCATGCTG
GGGCTCGCGT CCACCGCCCC CGTCTATTCA CTCGCCGCCA CGCTGGGCCT GATCGTGGCG
GTCAATGGCA ACTACACCCC GCTGATCCTT CTCCTGGGCT TTGTGCCGGT GCTCTTCATC
GCCTACGCCT TCCGGGAACT GAACACCGCC ATGCCGGACT GCGGCACCAC CTTCATCTGG
GCACGCCGGG CATTCGGGCC ATGGGCGGGC TGGCTGGGCG GCTGGGGCGT CGCCCTGGCC
GGCATCGTGG TGCTCGCCAA CCTGGCCCAA GTGGCAGGGC AATACCTGTG GCTGCTGGTT
GGCGACGGTT CGCTGGCCGA AAACGACCTT TTGGTGACGG CCACCGGCGT CGTGTTCATC
ATTTTCATGA CCCTGGTGAA TTACCGCGGC ATCCGGCTCG GCGAGCACGT CCAGCGGGTG
CTGACCTACG TGCAGTATGT CTCGCTGGGC ATATTCGCGC TGGCCATCGT GTTCCGGATT
ACGGGCGGAG CACCGGAGGG CCAGGCGTTC GACTTCGAAT GGTTCAACCC CGCGGGTGCC
TTCGCCGATC CCGGCGCAGT GGTTCACGGT GCCCTATTGG CCCTGTTCAT CTATTGGGGC
TGGGACACTT GCCTGGCCGT GAACGAGGAA ACCGAGAACC CGTCCAAGAC GCCCGGCCGC
GGGGCCGTCA TCTCCGCCTT CGTCCTGATG GCAATCTACG TTTCCGTGGC GCTCTTGGTG
ATGATGTATG CCACCGTGGG CACGGACGGC ATCGGCCTCG GCAATGCGGA AAATCAGGAT
GACGTCTTCC TGGCCATGAA GGATGTTGTG CTCGGCCCGT GGGGCTGGCT GATCGTGGTG
GCCGTGCTGG CCTCGGTGCT GTCCTCCACC CAGACCACCA TCCTGCCCAC CGCCCGCGGG
ACGCTGTCCA TGGGCGTCCA CGGCGCCCTG CCGGCCAAGT TCGGCGAAGT CCACCCGCGA
AACCAGACTC CGGGGTTCTC CACGCAGGTG ATGGGGGCTG CCGCCGTCGT GTACTACGTG
GCCATGAGCT TCCTGAGCCA GAACCTGCTT TCGGATTCCA TCAGCGCCAT CAGCCTGTTC
ATTGCTTTCT ACTATGCGCT GACCGGCTTC GCGTGCGTCT GGTTCTTCCG GGGCACCTTG
CGCGTTTCCG CCCGCAATCT CTGGTTCCGG GGCATCCTGC CGCTGCTCGG GGCACTGATG
CTGACCGCGG CGTTCTTCAT CTCCGCCGTT CAGATGTGGG ACCCCGCCTA CGGCGATACG
GAGATCTTCG GCGTGGGTGG CGCCTTCGTC AGCGGTGTGG TGCTGCTGGC GCTGGGCGTG
GTGCTGGCCG TGGTGTGCCG CTTCTCCCCC GCCACCCGGG ACTACTTCAT GGCACGGCAA
GCGCAGCCCA GCGAACTGTA G
 
Protein sequence
MSTGTTTAGG PGGSSGGPGP EVPSKGLRAG ILDLGDSVML GLASTAPVYS LAATLGLIVA 
VNGNYTPLIL LLGFVPVLFI AYAFRELNTA MPDCGTTFIW ARRAFGPWAG WLGGWGVALA
GIVVLANLAQ VAGQYLWLLV GDGSLAENDL LVTATGVVFI IFMTLVNYRG IRLGEHVQRV
LTYVQYVSLG IFALAIVFRI TGGAPEGQAF DFEWFNPAGA FADPGAVVHG ALLALFIYWG
WDTCLAVNEE TENPSKTPGR GAVISAFVLM AIYVSVALLV MMYATVGTDG IGLGNAENQD
DVFLAMKDVV LGPWGWLIVV AVLASVLSST QTTILPTARG TLSMGVHGAL PAKFGEVHPR
NQTPGFSTQV MGAAAVVYYV AMSFLSQNLL SDSISAISLF IAFYYALTGF ACVWFFRGTL
RVSARNLWFR GILPLLGALM LTAAFFISAV QMWDPAYGDT EIFGVGGAFV SGVVLLALGV
VLAVVCRFSP ATRDYFMARQ AQPSEL