Gene Noc_2765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2765 
Symbol 
ID3705303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3135929 
End bp3138184 
Gene Length2256 bp 
Protein Length751 aa 
Translation table11 
GC content58% 
IMG OID637739241 
ProductTPR repeat-containing protein 
Protein accessionYP_344742 
Protein GI77166217 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCCGA CCTCTCGATA CGGCCCGCAC CCCAACCGGC CCGCGGCGAA CAAGCAGTTT 
ACCAATCGGG ATGGGCCGAT TAAAGCCTTT CAAGCGGCCC GCCAGGCCCT CGCACCCGAC
CAGCATGATG TGCTGACGTT CCATGGCGTT GGCGGGCAGG GCAAAACAGC CCTGGGGAAG
AAATTTAGGC AAATCCTGGA GGAAGAGACG CCCCGTGCCG CGCTTTGGGG CCACGTGGAT
TTAGCGGACA CGCACCTGCC TACGCCCGAC CGGATTCTTC TCGAATTACG CCGCTCGCTC
CGGCGCTCGG GAAAAATAGA GTTTCCCGCT TTCGATGTCG CCTTTGCCCA TTACTGGCAG
CAAGCTTACC CGCATTTAAA ATTTCAGGAC ACCCATCGGG ATTTGCTCAC CGATAAGGAA
GGGGTTTTTG CCCACGCTCT GGAGTCGGCG AGTAGCTACG CGGAAGCCTT ACCGGCGGGC
ATTGGCCTTA CCATGATAGC GCTGAATTGG GTGCGCCAAC GAATGCGCGA TTTTGATAAC
CGCTATCGTC CCGCCCTCAG GGGTTTGGAA ACCTTCAGCC CCACGGAGAT TCTCGACCGC
CTTCCTTATT ACCTCGGCCT CGATTTATGC GGCTATCAGC GCCGGCAAAG GGCAAAACAG
CTCATTGTTT TTTTAGATAC CTATGACGCC TTATGGGAAA CCTTCACCCA GGGCGCCCGC
TGGGAGGTGG ATGCCTGGGC GCGGGAGCTG GTCGCGAGCG CCCCCGGCGT GCTGTTTGTG
GTCTTCGCAA GAGAGAAACT CACCTGGGAT ACCCATGAAC CCGATGCGTG GCGGGGATGT
CTTCACCATC AGCCCCCCCT TGCGCCGCTT ACTGATGAGG ATGCCGACAC GTTTTTGCGC
CAAATTCCGA TAGAAGAGGA GGCCGTGCGC CAGACCATTA TCCAAGCGGC CGAGGGCCAT
CCGTTTTATC TCGACCTGGA AGTGGAGCAC TATCTCGATT TAGTGGCTGA CGGCGCAACA
CCTCAACCGG AGCACTTTGG CCAGACAAAG CAAGCGATTC TGGCGCGATT TTTACGCTAT
CGGGCAGGCG AAGAACAAGC CACCCTCAAG GTGTTATCGG TGGTCCGCTC CTTTGACTTT
GAGCTATTGG AAGCGCTCGT CCGAGCGTTC CAGACGGGCT TTCCTCCCAC TGAATTCTTT
AATTTTGTAC GCTACTCTTT TGTAGCGCCA GAAAGCGGCG AGCGGTATCG CTTGCATTCG
CTGATGCAAG CGCATCTTCT GGAGAGCTTG GAGCCAGCGC TTCAAAAAAA GATCCATGAC
TTTGCGTTTA CCTATTATGA CGCGCGATGC CAACCCGAAT CCCCCAAAGA CCTTCAGCCA
GCCCATGAAA TCGCGTTAGT CGAAGCTTTT TACCACCGGG ATATGACCGA TCCCCAGGAG
GCGGCCGCAT GGATTAACCA ACGGACAACT GTCTTTTACG AAGCGGCCCG GTATTCGTTA
GTCGAACCCC TCTACCAACG GGCCTTGGCT ATTCGTGAAC AGATCCTCGG CCCCGACCAT
CCCGATACTG CCACCAGCCT CAATAACCTG GCGGAATTGT ATCGCGCCCA GGGGCGCTAT
GCCGAGGCCG AGCCCCTCTA TCAACGGGCC TTGGCTATTT GTGAACAGGT CCTCGGCCCC
GACCATCCCG ACACTGCTCG AAGCCTCAAT AACCTGGCGG GGCTGTACAA GGCCCAGGGA
GACTATGGGC AGGCCGAGCC CCTCTACCGG CGGGCCTTGG CTATTTGTGA ACAGGCCCTC
GGCCCCGACC ATCCCCACAC TGCCACCAGC CTCAATAACC TGGCGGGGCT GTACGATAGT
CAGGGACACT ATGGGCAGGC CAAGCCCCTC TATCAACGGG CCTTGGCTAT TTATGAAAAA
ACCCTCGGTC CCGACCATCC CCGCACCGCC ACCAGCCTCA ATAACCTGGC GGCGCTGTAC
GATACGCAGG GAGATTATGC GCGGATCGAG CCCCTCTATC AACGGGCCTT GGTTATTCAT
GAAAAAACCT GCGGTCCCGA CCATCCCCGC ACCGCCACCA GCCTCAATAA TCTCGCGGGG
CTGTATAAGG ATCAGGGAAG CTATGCCCAG GCCGAGCCCT TTTATCAACG GGCCTTGAGT
ATTTGTGAAA AAAACCTCGG CCCCGACCAT CCCGATACCC ATACGGTGCG TCAAAATTAT
CAAGCGCTGC TTGCCGCCAT GGGGCAGCAG AAATAG
 
Protein sequence
MPPTSRYGPH PNRPAANKQF TNRDGPIKAF QAARQALAPD QHDVLTFHGV GGQGKTALGK 
KFRQILEEET PRAALWGHVD LADTHLPTPD RILLELRRSL RRSGKIEFPA FDVAFAHYWQ
QAYPHLKFQD THRDLLTDKE GVFAHALESA SSYAEALPAG IGLTMIALNW VRQRMRDFDN
RYRPALRGLE TFSPTEILDR LPYYLGLDLC GYQRRQRAKQ LIVFLDTYDA LWETFTQGAR
WEVDAWAREL VASAPGVLFV VFAREKLTWD THEPDAWRGC LHHQPPLAPL TDEDADTFLR
QIPIEEEAVR QTIIQAAEGH PFYLDLEVEH YLDLVADGAT PQPEHFGQTK QAILARFLRY
RAGEEQATLK VLSVVRSFDF ELLEALVRAF QTGFPPTEFF NFVRYSFVAP ESGERYRLHS
LMQAHLLESL EPALQKKIHD FAFTYYDARC QPESPKDLQP AHEIALVEAF YHRDMTDPQE
AAAWINQRTT VFYEAARYSL VEPLYQRALA IREQILGPDH PDTATSLNNL AELYRAQGRY
AEAEPLYQRA LAICEQVLGP DHPDTARSLN NLAGLYKAQG DYGQAEPLYR RALAICEQAL
GPDHPHTATS LNNLAGLYDS QGHYGQAKPL YQRALAIYEK TLGPDHPRTA TSLNNLAALY
DTQGDYARIE PLYQRALVIH EKTCGPDHPR TATSLNNLAG LYKDQGSYAQ AEPFYQRALS
ICEKNLGPDH PDTHTVRQNY QALLAAMGQQ K