Gene Arth_3697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3697 
Symbol 
ID4443698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4158906 
End bp4160522 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content64% 
IMG OID639691521 
Productpermease for cytosine/purines, uracil, thiamine, allantoin 
Protein accessionYP_833172 
Protein GI116672239 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACATAT CGTATCTTGT TGCGTGTGGT CTGCGGCACA TACTCTGGCT GTATCCCCTT 
TCTCCCACGG AGGCCCACAT GCAAAACAAC TCCACTGCAG TCCCGGCCGG CCACGTGCCC
GCAGCCGGCG AAGACGTTGA AGCCTGGCTT CAGCCCATCC CCGAATCACA GCGGACCCGG
AAGGTTTCCG GACAGTTCTG GATCTGGGCG GGAGCCAACC TGGCGCCCAT CAACTGGGTG
CTCGGCGCCC TGGGCATCCA CCTCGGACTT GGCTTCGCCG ACACCGTTAC CGTCCTGGTT
TTGGGGAACC TGATCGGCAT GCTTCTGTTC GGCTGCTTCG TACTGCTGGG CCAGAAGACA
GGTGCCACGG GCATGGTGCT GGCCCGCGCG GCATTTGGCC GGCGCGGAAA CTACCTTCCC
GCCGCCATCC AGGCGCTACT GGTCATCGGC TGGTGCGCCG TCAACACCTG GATCATCCTG
GACCTTGTCA TGGCACTCTT CGGCACCCTC GGCTGGGTGG ACCCGACAGC CCACAACTAC
GCCTGGAAGA TCGGTGTCGC CACCACCATC ATGGCTGCGC AGGTTGCCAT CGCCTGGTTC
GGCTACAAGG CTATTGCGGC TTTTGAAAAG TGGACCGTGC CGCCCACCAT CATCATCCTC
GCGGTGATGT CCGCCGTCGC CTGGTTCGGC ATGAAGATCA ACTGGTCCTA CGCCGGACCC
GCCGGCAACA TCCTCGAAGG CTCGGAGCGG ATCGCGGCCA TGAGCGCCGT TATGACGGCC
ATCGGCATCG GCTGGGGAAT CACCTGGTTC ACCTACGCCG CTGACTACTC CCGTTTTGTG
AGCACCGAGG TGCCGAAGCG CAAGGTCTAC CTGGCCTCAG TGCTGGGGCA GTTCATCCCG
GTCGTCTGGC TCGGAGTCCT GGGCGCCAGC CTTGCAACCA ACAGTGGCGA GATCGACCCC
GGAAAACTCA TCGTGCAAAA CTTCGGAGTC CTCGCGCTTC CGGTACTCCT GATGGTCCTG
CACGGACCCA TCGCCACCAA CATCCTGAAC ATCTACACCT TCTCAGTCGC AACCCAGGCC
CTCGACATCA CCATCAGCCG CCGCAAGCTC AATCTGTTCG TCGGCGTTTT CTCGCTCATC
GCCGTCGTAT TCTTCATCTT CCAGGAGGAC TTCGCCTCCG TCCTGGATGC GTGGCTGATC
GGTCTCGTGG CCTGGGTGGC CGCCTGGGGC GGCGTCATGC TGGTGCATTT CTTCTGGATC
GAGAAGCGCT GGCCCGGCGA GGCCTCACGG CTGTTCGACG GCGTGGGCAC TAAGCGGCTC
CCCGGAGTCA ACTGGGCGGG CGTCGTGTCC CTCCTGGTCG GCATTTTCGC CACCTGGCTG
TTCATGTACG GCCTGGTTCC CGCAATGCAG GGCCCCATCG CAGTGGCACT GGGCGGCTGG
GACCTCTCCT GGCTCGCCGG CGGCGTCAGC AGCGCAGCGT GCTACGCAGT TCTTGGCCCC
CGGGTCCACC GGAAATTTCT GGCCGGCGGC GCCGCTGAGC CGGTCCAGGT AACCGTCCCG
GAGACGACGG TCCCGGAACC TTCAGCCCGC CCGTCAACCA CCGCAGTCTC GCTGTAG
 
Protein sequence
MYISYLVACG LRHILWLYPL SPTEAHMQNN STAVPAGHVP AAGEDVEAWL QPIPESQRTR 
KVSGQFWIWA GANLAPINWV LGALGIHLGL GFADTVTVLV LGNLIGMLLF GCFVLLGQKT
GATGMVLARA AFGRRGNYLP AAIQALLVIG WCAVNTWIIL DLVMALFGTL GWVDPTAHNY
AWKIGVATTI MAAQVAIAWF GYKAIAAFEK WTVPPTIIIL AVMSAVAWFG MKINWSYAGP
AGNILEGSER IAAMSAVMTA IGIGWGITWF TYAADYSRFV STEVPKRKVY LASVLGQFIP
VVWLGVLGAS LATNSGEIDP GKLIVQNFGV LALPVLLMVL HGPIATNILN IYTFSVATQA
LDITISRRKL NLFVGVFSLI AVVFFIFQED FASVLDAWLI GLVAWVAAWG GVMLVHFFWI
EKRWPGEASR LFDGVGTKRL PGVNWAGVVS LLVGIFATWL FMYGLVPAMQ GPIAVALGGW
DLSWLAGGVS SAACYAVLGP RVHRKFLAGG AAEPVQVTVP ETTVPEPSAR PSTTAVSL