Gene Saro_2566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2566 
Symbol 
ID3916888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2768850 
End bp2769950 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content66% 
IMG OID640445324 
ProductTrkA-N 
Protein accessionYP_497836 
Protein GI87200579 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0569] K+ transport systems, NAD-binding component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.353731 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACCGG ACGGCGAACT TTCGGCGCTG ACGCAGGGCG CCAACTCTGT ACTGGGGTCG 
CCGCTGCGCC TGCTGACGGG CACGCTGATC TACGAACTGG CAGTCTATGT CGTGTCCACC
TGCGGTTTCA TGTTCGCAGG CTGGCCGCTG GGCGATGCGT CCTACATGGT GCTGCTGACG
ATCTTCTCGG TCGGCTATGG CGAGATCCGC CCGGTTGACA CCCTCTTCCT GCGCTGGTGG
ACGAGCGGGA CGATCGTCCT GGGCTGTACC GGGCTGATCG TGCTGACCGG CGCGCTGGTG
CAGGTCTTCA CCCTGTTTCA ATTCCGACGC CTGCTGGGGC TGGACCGTAT GATGACCGAA
ATCGAGAAGC TGGATGGCCA CGTAATCATC TGCGGCTATG GCCGGATCGG CGTGCAGCTC
GCCCGGGCGA TGACCGAGGC GCGGCGGCCG TTCCTGATCC TGGAACGAGA CCACGCCAAG
GCCGACGAGG CCAAGACGCA AGGCTTCCTG TGCATGGTCG GCGAGGCCAC GCACGAGGAG
ACGCTGAAAG CGGCGGGGAT CGCGCGGGCC AAGGTGCTGG CGACGGTGCT GCCCGACGAC
GCGGCCAACG TGTTCATCAC GCTCTCGGCG CGCAATCTCA ACCCCGGCAT CGAGATCATC
GCGCGCGGCG AAGCGCCGAC CACCGAGAAC AAGCTGTTCC ACGCGGGCGC CGACAAGGTG
GTCATGCCCA CGCACATCGG CGCCGAACGG ATCGTCGAGA TGATCCTCTA CCCGGCAACC
GGCGAAGCGC TGGCGCAGAT CGGCGCAGTG AAGCGCAACC TCCACGACTT CGGCCTCGAC
CTCGAGGTCG TGGAACTCCT GCCCGACGGT GCGCTGACCG GAGAGACGGT GGGAGAGGCC
GAGCGGCGCG GCGATGGCGC GTTCTTTGTC GTGCAGATCG ACCGAACCAA CGGTCAGTCC
ATCGAACACC CCGGCGAGGA TGTGACGCTG GAGGCGGGCG ACAAGGTCAT GCTGGTGGTG
CGCGGCAGCA GATTGTCCGC CGGCGCGGTG TTCAGCGCGA CAAGGAAGCC GGTCAAGCGC
GGGCGGGCCT TTTCGGGATA G
 
Protein sequence
MEPDGELSAL TQGANSVLGS PLRLLTGTLI YELAVYVVST CGFMFAGWPL GDASYMVLLT 
IFSVGYGEIR PVDTLFLRWW TSGTIVLGCT GLIVLTGALV QVFTLFQFRR LLGLDRMMTE
IEKLDGHVII CGYGRIGVQL ARAMTEARRP FLILERDHAK ADEAKTQGFL CMVGEATHEE
TLKAAGIARA KVLATVLPDD AANVFITLSA RNLNPGIEII ARGEAPTTEN KLFHAGADKV
VMPTHIGAER IVEMILYPAT GEALAQIGAV KRNLHDFGLD LEVVELLPDG ALTGETVGEA
ERRGDGAFFV VQIDRTNGQS IEHPGEDVTL EAGDKVMLVV RGSRLSAGAV FSATRKPVKR
GRAFSG