Gene Noca_4103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4103 
Symbol 
ID4596617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4334065 
End bp4335813 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content68% 
IMG OID639778709 
ProductABC transporter related 
Protein accessionYP_925287 
Protein GI119718322 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0411] ABC-type branched-chain amino acid transport systems, ATPase component
[COG4177] ABC-type branched-chain amino acid transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGGCAC TCTGGCCCCA GATCGCGGTG GGGTTCGGCC CCCAGCAGTC CTACTACGTC 
GACCTGGCCA CGACCGCCCT GATCCTGACC ATGCTGGCCA TCTCGCTCAA CCTGGCGATG
GGCATCGCCG GGCTGCTCTC GTTCGTCCAC ACAGGGTTGC TGGCCGTCGC CGGCTACACC
CTGGGCATCT CGGCGCTTCG ATGGGGAGTC AACCCCTGGC TCGGGATCCT GCTGGGTGTC
GTCGTCACTG TCCTGTTCTG CATCCTGATC AGCCTGGTAT CGGCTCGCGC CACCAACATC
TACTGGGGCC TGATCACGTT GAGCTTCGAC CTGGTGCTCA TCCAGGTGAT CGAGCAGTGG
GGGTCGGTGA CCGGCGGCTT CAACGGCCTG CCCGGCGTCC CGCGCCCGCA GCTGAACGGT
GCCCCTCTCG ATACCCAGTC GTTCTACTAC CTGTGCCTGG TGATGGCGAT CGGCTGCTAC
CTGGTCACGA GGAACCTGGT GAAGTCGCCG GCCGGACGAG GGTTCCACGC GCTCCGTCAG
TCTCCGGAGA CCGCTTCCAG CCTGGGCATC AACGTTCCCG CCGCACGGCT GCTCGCCTTC
GCCATCGCCG GCGCCCTGGC AGGCTTCTCG GGCGCGCTGT TCGCGATGCA CCTGAACTTC
ATCAGTCCCG AGGTCGCCGA CCAGGGGCCC CAGCTGCCAC TGTTCATCGC GATCTTCCTC
GGAGGGTTCG GGACCCTGCT GGGTCCCGTG CTCGGCATGG TCGTCGTCAC CGCCGTTCAG
CACGAGCTGC GTGGCTCGGG TGAGAACGCG CAGCTGATCC TGGGACTCTT CTTGTTGGCC
TGCATTCTCA TCCTCCCCAA GGGCATCGTC GGCACCTGGA TGTCGACCAG GTTCGGACGT
TGGGTCAGTG TCGCCGCGCA TCCAGCGGTG CCGCCACCGG ACGAGTCCGA GTCCCTGGCG
CTCCTGCAGA ACAGCACGAG TGGCGCCGAG CCGGTGATGA GGTGCCAGGA CGTCGGCAAG
ATGTATGGCG GAGTGCGCGC CCTGGGCGGT GTCGACCTGG AGCTTTACCC GGGCGAGATC
CACGCGCTCA TCGGACCCAA CGGCGCTGGC AAGTCGACCA TGGCCGGCTG CATCACCGCC
CACATCCAAC GTGACTCCGG CACCGTGGTG GTCGGTGGGA GGCCGTTGCC CAAGCGCCCG
CACGCGGTGG CCAAGCGTGG GGTGACCCGC GTCTTCCAGG AGCCGCACCT CTTCGAGGGC
GGAACCGTGG AGGACAACGT CCTCACCGGC ATGTACGGCT GGGGCCGATC GGGCTGGCTG
CCAGCCATCG TGCGGACTCC CGGGTATCTG CGTGCCGAGC GCGAGCGTCG CCAGCAGGTC
CGGCTCCTCT TGCGCGCGGT CGGGCTCGAG AGCCTGGTCC ACCTGTCCGC GTCGTCGTTG
TCGCATGGCC AGAAGCGGCT GGTCGAGGTC GTCCGCGCCG TCGCGACCCG TCCGACCGTC
CTCATCCTGG ACGAGCCCGC GACGGGACTG ACGCCGGGTG ACCTCGCGCA ACTGGACAGC
CTGCTGCGCA CGCTCAGGCA GTTCGGGATC GCGCTCCTCC TGATCGAGCA CAACATGGGC
TTCGTCCTCC CGCTGTCCGA CCGGGTCACG GTTCTGAACT TCGGTGAGGT GATCGCCCAC
GGCACGCCGC AAGAGATCGT CGACTCTCCC GTCGTCCGGG AGGCCTACCT GGGAGGCGCG
ACCGTATGA
 
Protein sequence
MLALWPQIAV GFGPQQSYYV DLATTALILT MLAISLNLAM GIAGLLSFVH TGLLAVAGYT 
LGISALRWGV NPWLGILLGV VVTVLFCILI SLVSARATNI YWGLITLSFD LVLIQVIEQW
GSVTGGFNGL PGVPRPQLNG APLDTQSFYY LCLVMAIGCY LVTRNLVKSP AGRGFHALRQ
SPETASSLGI NVPAARLLAF AIAGALAGFS GALFAMHLNF ISPEVADQGP QLPLFIAIFL
GGFGTLLGPV LGMVVVTAVQ HELRGSGENA QLILGLFLLA CILILPKGIV GTWMSTRFGR
WVSVAAHPAV PPPDESESLA LLQNSTSGAE PVMRCQDVGK MYGGVRALGG VDLELYPGEI
HALIGPNGAG KSTMAGCITA HIQRDSGTVV VGGRPLPKRP HAVAKRGVTR VFQEPHLFEG
GTVEDNVLTG MYGWGRSGWL PAIVRTPGYL RAERERRQQV RLLLRAVGLE SLVHLSASSL
SHGQKRLVEV VRAVATRPTV LILDEPATGL TPGDLAQLDS LLRTLRQFGI ALLLIEHNMG
FVLPLSDRVT VLNFGEVIAH GTPQEIVDSP VVREAYLGGA TV