Gene Arth_4017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4017 
Symbol 
ID4447818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4536024 
End bp4537010 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content64% 
IMG OID639691848 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_833492 
Protein GI116672559 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTCCGTT ACCTCGCCAA AAGGGCAGTT ACCTACCTGC TCATGATTTT CCTGACCACC 
ACGGCGGGGT ACTTCCTCGC CGTCAGCACG CTTCAGCCGG CGCTGCTGGA GCAGGAACGC
ATTCCGCGGC CGACGCCCGA GCAGGTGACA AACTCCTTCC GCCTCAAGGG CCTTGATCCG
ACGCTGAGCC CGTGGGAGCG CTACGTTGAC TGGCTGACGG CAGTGGTCAC GCGCTGGGAC
TGGGGCCGAA GCCCCAACGG CGCTTTCATC AATGCCGAGT TCGGCGACCG CGTCTGGATC
TCTACCCGGC TGTTCCTGGC CTCCATCGTC CTCACCCTGA TCATCGGCGT GGCACTGGGC
GTGTACACCG CAGCCCGGCA GTACAAGTTT TCCGACCGCG CGATCACCTC TTACAGCTAT
CTCGTATACA TCGTGCCCGC GCCCATCGCG TACTTCCTGG TCCAGCTGGG TGCCATCAAC
ATCAACGAAA CGGTCGGCGA ACGGATTCTA TTCGTCACCG GCATCTCCAC CCCCGGGCTG
GAAGGCAACG GCTGGGTCCA GTTCCTGGAC ATGCTGGCCC ACTACGCCGT GCCCACGTTT
GCCATCACCA TCGTGGGCTG GGGGACCTAC CAGATCGCCC AGCGCCAGTA CCTGCTGGAC
AACGTCAACG CCGACTTTGT CCGGACGGCC CGGGCCAAGG GCCTCACCCG CAACCAGGCC
ATCACCCGCC ACGCCCTTCG GGTCTCGTTC ATCCCGGTGG CGCAAAGCAT TGCGTTCACC
ATTCCGGCCA TCTTCGCCGG CGGATTCTTT GCCGAGAAGA TCTTCGCCTG GCACGGCGTC
GGCTCCTGGA GCATCGACGC GATCGCCTTG CAGGACGTGA ACGCGGCCAC GGCCACTTTG
GCTTACGGCT CGGTGATTTT CGCGATCGGA GCCATCCTCG CGGACTTTGC CACCACCCTT
GTCGACCCGA GAGTGCGGGT GCAGTAG
 
Protein sequence
MFRYLAKRAV TYLLMIFLTT TAGYFLAVST LQPALLEQER IPRPTPEQVT NSFRLKGLDP 
TLSPWERYVD WLTAVVTRWD WGRSPNGAFI NAEFGDRVWI STRLFLASIV LTLIIGVALG
VYTAARQYKF SDRAITSYSY LVYIVPAPIA YFLVQLGAIN INETVGERIL FVTGISTPGL
EGNGWVQFLD MLAHYAVPTF AITIVGWGTY QIAQRQYLLD NVNADFVRTA RAKGLTRNQA
ITRHALRVSF IPVAQSIAFT IPAIFAGGFF AEKIFAWHGV GSWSIDAIAL QDVNAATATL
AYGSVIFAIG AILADFATTL VDPRVRVQ