Gene Noca_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1842 
Symbol 
ID4597162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1965841 
End bp1967025 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content65% 
IMG OID639776441 
Productextracellular ligand-binding receptor 
Protein accessionYP_923040 
Protein GI119716075 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.843132 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTCA AGAAGGCGTA CCTCGCGGCT GGCGCCACGC TGGCGTTGGC GCTCTTCGTC 
AGCGCGTGCG GCAGCGACTC GGGCGGGTCC GGCGGCGGTG GCGATGACAC GATCACCGTG
GCGGTCGCGG GTCCCATGAC CGGAGACAAC GGCATCTACG GCCAGGATCA GCTGTCGGGC
GTGCAGTTCG CAGCGAAGGA GATCAACGAC TCGGGCGGGA TCCCTGACGG TCCGTTGAAG
GGCAAGAAGA TCAAGGTCGT CAAGTTCGAC GACGTGGCCG ACCCCAACCA GGGTGCGTCC
GTGGCGCAGA AGATCTGTGA CGACACCAGC ATCATGGCGG TCTTCGGCCA CAGCAACTCC
TCGGTCACGC TCGCCGCGGA GCCGATCTAC GAGCGCTGCG GGGTGCCGCT CTTCGTCAGC
TACTCGTCGA ACCCGGAGAT CACCGCGGAG CTACACGAGA ACCTGTTCCG CACGCTCATC
GACGACGCCA AGATGGGCAG CGAGATGGCG AGTTTTTCCC ACGACCAGCT CGGCTTCAAG
AAGGTCGGCG TCATCGCCTC CGACGACGAC TACGGCGACG GCCTGAAGAC CAACTTCAAC
AAGACGGCCG AGGAGATCGG CCTCGACGTC GCGAAGACCG TCACGACGTC GGCGAAGCAG
AAGGACTTCA CGCCGCAGCT GACCGAGCTC CGCAACGCCG GTGCCGACTC GCTGGTGCTC
CTGAACACCT ACACGGACGC CGCGCTGCAG ATCAAGCAGG CCGACGCGAT GGGCTGGGAC
GTCCCGATCT TCGTCACCCC GGGCTCGAAC AGCCCGGAGC TGGTCAAGAT CGCCGGTGAG
AAGGCGGCGG AGGGCACAAT CGTCGCCGCG GTCTTCGACC CCAACTCGAG CGAGCCGGGC
CCGGCGAAGT TCGTCAACGA CTTCACCGCC GCCAACGGCA AGGGTCCGGG CGAGTCCGCC
GCGATGTCCT ACGACTCCTT CTACGTGTTC CTGACCTCCC TGGAGAAGGG TGCGAAGGAC
CGCAAGAGCG TCATCGAGAA GTCCGCCGAG ATCGGGACGT TCACACTCCC GATCCGCGGC
GAGCTGATGT TCAACGAGAC CCACGAGCCG ACGGTCGTGC CGGGCAAGCC CGCGCAGATC
CTGCTCCAGG TCAAGGACGG CCAGATCGGC AGCTACGCCG GCTGA
 
Protein sequence
MKFKKAYLAA GATLALALFV SACGSDSGGS GGGGDDTITV AVAGPMTGDN GIYGQDQLSG 
VQFAAKEIND SGGIPDGPLK GKKIKVVKFD DVADPNQGAS VAQKICDDTS IMAVFGHSNS
SVTLAAEPIY ERCGVPLFVS YSSNPEITAE LHENLFRTLI DDAKMGSEMA SFSHDQLGFK
KVGVIASDDD YGDGLKTNFN KTAEEIGLDV AKTVTTSAKQ KDFTPQLTEL RNAGADSLVL
LNTYTDAALQ IKQADAMGWD VPIFVTPGSN SPELVKIAGE KAAEGTIVAA VFDPNSSEPG
PAKFVNDFTA ANGKGPGESA AMSYDSFYVF LTSLEKGAKD RKSVIEKSAE IGTFTLPIRG
ELMFNETHEP TVVPGKPAQI LLQVKDGQIG SYAG