Gene Noc_1225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1225 
Symbol 
ID3706415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1341340 
End bp1342308 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content51% 
IMG OID637737727 
ProductApbE-like lipoprotein 
Protein accessionYP_343256 
Protein GI77164731 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.65797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTTT ACCGGGAAGC CTTCAAGGCG ATGGGTACGC CCTGTGAAAT TCAGCTCTAT 
GCCAAGACTA AGGTTCAAGC TCAACGGGCC GCGGGACTTA TTATCGCCGA TGTGCGCCGG
TTGGAAGCTC GCTATTCCCG CTACCGCACT GATAGTTTTC TCTCCGATAT CAACCGGGTG
GCCATGGCGG GTGGGCGTAT TTCGGTAGAC AAAGAAACAG AAGGATTGCT CAATTATGCA
GCTACCTGTT ATGGGCAAAG TAAAGGATTA TTTGATATTA CTTCCGGTAT TTTACGGCGC
GCTTGGGATT TTAAATCAGG CGAGCTGCCG AGCAGGGTGC AAGTACAGGA ACTGCTGGAC
AAAATTGGCT GGGAAAAACT GCGCTGGGCG CCGCCGGTAT TAGAATTTCC CATAGCTGGA
ATGGAGATTG ACTTTGGAGG TATCGTCAAG GAATATGCCG CCGACCGGGC GGCTTCGCTG
AGCTTACAGG CAGGAGTTCG GCGAGGCCTA GTAAACTTAG GCGGTGATAT CAAAGTGATT
GGTCCTCGCC CGGACGGTGA GCCTTGGCGG ATTGGTATCC GCCATCCTGG ACATAAAGAA
GCCGCGATGG GAATGCTGCT GCTACATAAA GGCGCGATTG CTAGTAGTGG TGACTATGAG
CGCTGCATTG TATTAGATGG CGTTCGTTAC GGCCATGTCC TCAATCCTCA GACGGGTTGG
CCGGTACGGC ACTTGGCATC GGTGACCGTA ATCAGTGATT TATGTGTGGT AGCAGGAAGC
GCTTCTACTA TTGCCATGCT AAAAGAGGCT AATGGTTCAG CCTGGTTGCA GAATTTGGGA
TTACCTCATT TCTGGGTGAA CATGGAAGGT GAAACCGGTG GTTCCCTAGA GGAAGGCTCC
ATTGAAGCTG TTGCTATGGT TCAAAACAGC GAGCGCCAGG CGGCAATCAC GGTACTCAAC
GAATATTAA
 
Protein sequence
MKLYREAFKA MGTPCEIQLY AKTKVQAQRA AGLIIADVRR LEARYSRYRT DSFLSDINRV 
AMAGGRISVD KETEGLLNYA ATCYGQSKGL FDITSGILRR AWDFKSGELP SRVQVQELLD
KIGWEKLRWA PPVLEFPIAG MEIDFGGIVK EYAADRAASL SLQAGVRRGL VNLGGDIKVI
GPRPDGEPWR IGIRHPGHKE AAMGMLLLHK GAIASSGDYE RCIVLDGVRY GHVLNPQTGW
PVRHLASVTV ISDLCVVAGS ASTIAMLKEA NGSAWLQNLG LPHFWVNMEG ETGGSLEEGS
IEAVAMVQNS ERQAAITVLN EY