Gene Noca_1296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1296 
Symbol 
ID4598918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1370039 
End bp1371259 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content72% 
IMG OID639775890 
Productmajor facilitator transporter 
Protein accessionYP_922497 
Protein GI119715532 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.753687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCTCGT TCCTCAAGCA GCCCCGCGCC GTCTGGGCCG TCGCCTTCGC CTGCGTCATC 
GGCTTCATGG GCATCGGGCT GGTCGACCCG ATCCTCAAGG AGATCGCGGC CGACCTCGGC
GCGACCGGCA GCCAGGTGTC GCTGCTGTTC AGCAGCTACA TGGCGGTGAT GGGCGTCGCG
ATGCTCGTGA CGGGCGTGGT CTCGAGCCGG ATCGGCGCGA AGCGGACCCT GCTCTCGGGG
CTGGTGCTCA TCGTCGTGTT CGCCGCACTG GCCGGCTCGT CGGGCTCCGT CGGCGCCGTG
ATCGGCTTCC GGGCCGGGTG GGGTCTCGGC AACGCGCTCT TCGTCGCGAC CGCGCTGGCG
ACGATCGTGC AGTCGTCGAA GGGCTCGGTC GCGCAGGCCG TCATCCTGTT CGAGGCCGCC
CTCGGCCTGG GCATCGCCTC CGGCCCGCTG GTCGGCGGCC TGCTCGGCGA GCAGTCCTGG
CGCGCGCCCT TCTACGGCGT GTCCACGCTG ATGACGATCG CGGCGGTCGC GACCGCGCTG
ATGCTCCCGG CCACGCCGCC TCAGGGTCGG CGTACGTCGC TCGCCGACCC GTTCCGGGCC
CTGCGCCACC CGGCACTGCT GCTGCTCGCG CTGGTCGCGG TCTGCTACAA CCTGGGGTTC
TTCACGCTGA TGGCCGCCGG ACCGTTCGCA CTGCCGAGCT ACGGGATCAT GGAGATCGGC
TGGACGTTCT TCGGCTGGGG TGTGCTGCTG GCGTTCACCT CCGTGGTCCT CGCGCCGTGG
CTGCAGCGGC GCTTCGGCAC GCTGCCCACG CTGACCGCGG TGCTGGCCCT GTTCGCGGCG
GTGCTCGCGG TGCTGGCGGT GAACTCCGGG AACGAGACCA CGATCACCGT CGGCATCGTG
GTGATCGGCG GGCTCCTCGG CATCAACAAC ACGCTGGTGA CCGAAGCGGT CATGGGCGCG
GCCCCGGTCG AGCGGGCGGT CGCCTCGGCG GCGTACTCCT TCGTCCGGTT CACCGGCGGT
GCGATCGGTC CGTACGTCGC GATGAAGCTC TTCGAGCGCC ACGGAGCGGC CGCGCCGTTC
TGGTTCGGCG CCGCGGCCGT CACCGTCGGG GTCGTGGTCA TCGCCGCCGG CAGCACCACG
ATCCGGCGGG CCCTGGTCGG CACGCCCGCG TCCCACTCCG CCGCCGAGGC CGAGGCGGAG
CTGCTCGGCG ACCTGGCCTG A
 
Protein sequence
MSSFLKQPRA VWAVAFACVI GFMGIGLVDP ILKEIAADLG ATGSQVSLLF SSYMAVMGVA 
MLVTGVVSSR IGAKRTLLSG LVLIVVFAAL AGSSGSVGAV IGFRAGWGLG NALFVATALA
TIVQSSKGSV AQAVILFEAA LGLGIASGPL VGGLLGEQSW RAPFYGVSTL MTIAAVATAL
MLPATPPQGR RTSLADPFRA LRHPALLLLA LVAVCYNLGF FTLMAAGPFA LPSYGIMEIG
WTFFGWGVLL AFTSVVLAPW LQRRFGTLPT LTAVLALFAA VLAVLAVNSG NETTITVGIV
VIGGLLGINN TLVTEAVMGA APVERAVASA AYSFVRFTGG AIGPYVAMKL FERHGAAAPF
WFGAAAVTVG VVVIAAGSTT IRRALVGTPA SHSAAEAEAE LLGDLA