Gene Nham_3503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_3503 
Symbol 
ID4029434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp3850507 
End bp3851817 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content64% 
IMG OID637971915 
Productmajor facilitator transporter 
Protein accessionYP_578692 
Protein GI92118963 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.993279 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTGA TGGCCGAGCC GCCGTGTGCG TCGGCATCGG CAAATGGACG TTTTTCGATG 
GCGACAACCG AAGAGAGCGT TCGGCGTCTC GGGCGCGAAG CGGGATGGCG CACGCCGCTG
GTGATCATCG TCTGCGGCTG CCTGATCGGG ATGCTGACGT TCGGGCCGCG TTCGGCCTTC
GGCTTTTTCA TGCAACCCAT GAGCAGCGAG TTTTCGTGGG GCCGCGACGT TTTCGCGCTG
GCTTTCGCGG TTCAGAATCT GCTGTGGGGA ATCGGCCAAC CCTTCGCCGG CGCGATTGCC
GACAGGTTCG GCAGCGTGCG CGTGATTTGC ATCGGCGCGT TGATGTATGC CGCCGGCCTG
TTGGTGATGC GCTATGCGGG CACGCCGTTG TCGCTCAATA TCGGCGCCGG ATTTTTGATC
GGTTTCGGCC TGTCCGGCTG TTCGTTCAAT CTGGTGCTGT CGGCGTTCGG CAAGCTGTTG
CCGGAGCAAT GGCGTGGCAT CGCGCTGGGC GCCGGCACCG CGGCGGGATC GTTCGGACAA
TTCGTGTTCG CGCCGTTCAG CGTTGCGCTG ATCGACAATT TCGGCTGGCA GCCGGCGCTG
ATCGTGTTTG CGGTGCTCAT GCTGTTTGTG GTGCCGTTTG CGCTCGTGTT GTCGACTCCG
CCAAGCGATA CCGGCAGTAC GGCTGCCGCC GCACCGGAAC AATCTTTCAA GAGCGCGCTG
GCGGAAGCCT TCGGCCACCG CTCCTATGTC CTGCTGGTGC TCGGCTTCTT CACCTGTGGA
TTCCAGCTCG CCTTCATCAC CGCGCACCTG CCGGCCTATC TGGTCGATCG CGGCCTGTCG
GTTCAGACCG GCGGATGGGT GCTGGCGGCG ATCGGCCTGT TCAACATCAT CGGCTCGCTG
AGCGTCGGAT GGCTCTCGAC CAGGATGCCC AAGCGCTACA TCCTGTCGGC AATCTACTTC
ATCCGCGCGC TGTCGATCGT GGTCTTCATT TCAACCCCGA TGACGACATT CTCCGCGGTC
GCGTTCGGCG TCGTCACCGG CCTGACCTGG CTGTCCACGG TGCCGCCAAC CACGAGTCTC
GTGGCGCTGA TGTTCGGCAC CCGCTGGCTC GCGACGCTCT ACGGCTTCGC GTTCTTCAGC
CATCAGGTCG GCGGGTTTCT CGGCTCGCTG CTGGGCGGCG TAGTGTTCGA CCATTTCGGT
TCCTACACCC CGGTCTGGTG GTTGTCGGTG CTGTTCGGCG TGCTGTCCGC ATTGATCAAT
CTGCCGATCG TCGAAGCCCC GGTCCGGCGG GCGGTTGCGC AGCCTGCATA A
 
Protein sequence
MALMAEPPCA SASANGRFSM ATTEESVRRL GREAGWRTPL VIIVCGCLIG MLTFGPRSAF 
GFFMQPMSSE FSWGRDVFAL AFAVQNLLWG IGQPFAGAIA DRFGSVRVIC IGALMYAAGL
LVMRYAGTPL SLNIGAGFLI GFGLSGCSFN LVLSAFGKLL PEQWRGIALG AGTAAGSFGQ
FVFAPFSVAL IDNFGWQPAL IVFAVLMLFV VPFALVLSTP PSDTGSTAAA APEQSFKSAL
AEAFGHRSYV LLVLGFFTCG FQLAFITAHL PAYLVDRGLS VQTGGWVLAA IGLFNIIGSL
SVGWLSTRMP KRYILSAIYF IRALSIVVFI STPMTTFSAV AFGVVTGLTW LSTVPPTTSL
VALMFGTRWL ATLYGFAFFS HQVGGFLGSL LGGVVFDHFG SYTPVWWLSV LFGVLSALIN
LPIVEAPVRR AVAQPA