Gene Arth_3413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3413 
Symbol 
ID4444143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3839189 
End bp3840898 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content63% 
IMG OID639691237 
Productpermease for cytosine/purines, uracil, thiamine, allantoin 
Protein accessionYP_832888 
Protein GI116671955 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCGCG ACGCCCTTGA CCATAAGGTG ACCCAGGTCA TAAAGTTATT CCACTCCTCA 
AAACCAAATT TCCATCTTGT GGAATTACTC TCTCGGAAGC AATGGCTTCC CCCATCACCG
TCCCTAGAAC TGGAGAACCA GATGCAGACG ACCCCATCAG TCGGCGTCGA AACGGATTCC
GGCCCGACAG AGTTGTCGGA CCAGCCGGTC CATCCCGCAA CGGGCGTCGA AGCCCTGTGT
GAAACAGCGA GCGCCGCGTC CGGGCGCACC ATCAGCCCCA GCCTGTACAA CATCGACCTT
GCTCCGACGA AGCGGGAAGG CCGCCGCTGG ACGAGCTACA GTATCTTCAC CCTGTGGGCC
AACGACGTCC ACAGCCTTGG AAACTATGCC TTCGCCATCG GGCTTTTCGC ACTGGGCCTC
GGCGGCTGGC AGATCCTGCT GGCCCTCGGC GTCGGTGCCG TCCTCCTGTT CGGGCTCCTA
AGCTTCTCGG GGTTCATGGG CGTCAAAACC GGAGTGCCGT TCCCCGTCAT GAGCCGGATC
AGCTTCGGCA TCAGGGGAGC CCAGATCGCC AGCCTCCTGC GCGGTGCCGT GGCCGTGGCC
TGGTTCGGCA TCCAGACCTA CCTGGCGTCG GTGGTGCTTC GCGTCATGCT CGTGGCCATG
GTTCCTTCGC TGAAGGAGCT GGACTCCAAC TCGATCCTGG GGCTGTCCAC CCTGGGCTGG
GCCGCCTTTG TGTTCCTCTG GATCGTGCAG CTGATCATCG TCAGCTTCGG CATGGAAATG
ATCCGCAAGT ACGAGGCTTT TGCCGGGCCC ATCATCCTGG TGACCATGGC GGCCATCGCC
GTCTGGATCT TCATCGAAGC CGGCGGCTCC ATTGCCTGGT CTTCGGACAA CGCCCTCGAA
GGGGCGGACA TGTGGCGCAC CATCTTCGCC GGCGGCGCCC TGTGGGTGTC CATCTACGGA
ACTTTCGTCC TGAACTTCTG CGACTTCACC CGGTCCGCCG TTTCCAAGAA GGCAGTGGTC
CGCGGCAACT TCTGGGGCAT CCCCATCAAC ATGCTCCTCT TCGGCGCCAT TGTGGTGGTC
ATGGCCGGCG GCCAGTTCAA GATCAACGGC ACGGTCATCC AGAGCCCGTC GGACATTGTC
CAGACCATAC CGAACACCTT GTTCCTGGTC CTGGCGTGCC TCGCGCTGCT CATCCTGACC
ATCGCCGTGA ACCTGATGGC AAACTTCGTG GCCCCGGTCT ACGCCCTGAC CAACCTCTTC
CCGAGGCACC TGAACTTCCG CAAGGCGGCC TGGGTCAGCG GCACGATCGG ACTGATCATC
CTGCCGTGGA ACCTGTACAA CAATCCCCTC GTGATTGTGT ACTTCCTCGG CGGCCTCGGA
GCGTTGCTCG GCCCGCTGTT CGGTGTCGTG ATGGCCGACT ACTGGCTGCT GCGCCGCGGC
CGGGTCAATG TCCCCGAGCT TTACACGGCT GACCCCGCCG GAGCCTACTA CTACAAGAAG
GGCGTGAACC CCCGGGCGAT CATCGCGCTC GTCCCGGCGG CCGTCGTCGC GCTCCTGATA
GCCTTCGTTC CGGCGCTCGA GGCAGCCGCC CCCTTCGCCT GGTTCTTCGC GGCCGGCATC
GCCGCAGTGG TGTACTACTT CATCGCCGAC CGCTCGCAGC GGCTCGAAGA TGTCGACGGC
GAATCCATCG CCGTCGCGAG CACGCACTGA
 
Protein sequence
MRRDALDHKV TQVIKLFHSS KPNFHLVELL SRKQWLPPSP SLELENQMQT TPSVGVETDS 
GPTELSDQPV HPATGVEALC ETASAASGRT ISPSLYNIDL APTKREGRRW TSYSIFTLWA
NDVHSLGNYA FAIGLFALGL GGWQILLALG VGAVLLFGLL SFSGFMGVKT GVPFPVMSRI
SFGIRGAQIA SLLRGAVAVA WFGIQTYLAS VVLRVMLVAM VPSLKELDSN SILGLSTLGW
AAFVFLWIVQ LIIVSFGMEM IRKYEAFAGP IILVTMAAIA VWIFIEAGGS IAWSSDNALE
GADMWRTIFA GGALWVSIYG TFVLNFCDFT RSAVSKKAVV RGNFWGIPIN MLLFGAIVVV
MAGGQFKING TVIQSPSDIV QTIPNTLFLV LACLALLILT IAVNLMANFV APVYALTNLF
PRHLNFRKAA WVSGTIGLII LPWNLYNNPL VIVYFLGGLG ALLGPLFGVV MADYWLLRRG
RVNVPELYTA DPAGAYYYKK GVNPRAIIAL VPAAVVALLI AFVPALEAAA PFAWFFAAGI
AAVVYYFIAD RSQRLEDVDG ESIAVASTH