Gene Rpal_4454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4454 
Symbol 
ID6412138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4785764 
End bp4787635 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content62% 
IMG OID642714336 
Productflagellar hook-associated protein FlgK 
Protein accessionYP_001993425 
Protein GI192292820 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTCG GAGACGCACT CGCCATCGCA ATGGCCGGCC TTCGGGTGAA CCAGGCGACA 
ATGTCGCTGG TGTCATCCAA CGTTGCCAAC GCCGAGACGC CCGGCTATGT CCGCAAGACG
GTCAACCAGT CGGCGGTGAA TGCGGGGGAG ACCGGCACCA GCGTCAAGGT CGTCGGCGTC
GATCGTCAGC TCGACGACTA CATCCTGGCG CAGCTTCGAA CCGAGACTTC GGGAGGGGCC
TACGCGTCGT TGCGATCGGA CTTCCTGAAG CAGCTGCAGG GGCTATTCGG CGATCCGAAC
TCCAGCGGGA CGCTCGAAAA TTCCTACAAC GGCCTCACCA CCGCGCTGCA GGCGCTCGCC
ACCAGTCCGG ACAGCACGTC TGCACGGATT GCTGTGCTGA ATGCCGCGCA GGCGATTTCG
AGCAATCTGA ATTCGACATC GAACGGCATT CAAACGCTGC GCGCAGGGTG CGAAACCGGC
ATCGCCGACG CGGTGACCAC CGCGAACAAC TTGATGCAGC AGATCGCCAA CATCAACACC
CATATCCAGA CCAATCCGCT CGGTGGCACA TCGACCGATG CGGCCACCTC GGCGATGCTC
GATCAGCGCG ATCAGGCGAT CAATCAGTTG TCGCAACTGA TGGATATCCG CGTCGTCACC
AATGGCGCCA ATCAGGTGAC CGTGTTCACC GGATCCGGCG TGCAGCTCGT CGGCATGGAA
GCGGCGAAGC TGACCTTCAA TGCGCAGGGC ACGGTTTCGC CGACCACGAT GTGGAATCCG
AACACGGCGA TCAGCGATCT CGGCTCGATC CGGGTCGGCT ACGCCAACGG CTCCTCTTCC
GATCTGACCA GCTCGCTGAA ATCCGGTAAG TTGGCCGCCT ATGTCGAGTT GCGCGACAAG
ACCCTTGTCC AGGCCCAGAC CCAGCTCGAT CAGTTTGCGG CCTCGCTGTC GAGCGCGCTT
TCAGACAAGA CGACGGCCGG CACGGCTGTC ACCTCCGGCA CCCAGGCGGG TTTCAGTCTC
GATCTGTCGT CGATGAAGAC CGGAAACACC GTCAACATCT CTTACACCGA TACGCTGACC
GGTGCGCAGC ATACGGTGAC GGTGGTGCGG GTCGATGATC CGTCGGTGCT GCCACTGCCG
CAGAACGCCA CCGCCGATCC GAACGACTAC GCGGTCGGGA TCGATTTCTC CGGCCTGTCC
GGCTCGGTGG TGTCGCAGCT CAACGCTGCG CTCAACAGCC GTAACCTGCA GTTCAGCGGC
ACCGCGCCAA ACATCACGGT GCTGAACAAC GCAGGCTTCT CGACGATTAC CGCGGCCTCA
GTCACCAGCA CCGAGACGTC GCTGACCGGC GGCACCGCAT CGGTACCGCT GTTCAATGAC
AACGGCGCGG CTTACACCGG GGCGATCAAC GGCTACGGCA GCCAGATGAC TGGCTACGCG
CAGCGGATCT CGGTCAACGC CGACCTGATC AAGGATAACT CCCGGCTGGT GGTGTATTCG
ACCTCGCCGC TCACGGCCGC GGGCGATACC ACGCGGCCGG ACTTCCTGGT CAAGCAGCTC
AACACCAGCA AGTATCTGTA TTCGGCGAAG ACCGGCATCG GTTCGGACGC TGCACCTTAC
AAGGGCACGC TGCTGAGCTA TCTGCAGCAG TTCGTCAGCC AGCAGGGCTC CAATGCGGCG
GCGGCGCAAC AACTGTCCGA GGGGCAGAAC GTGGTGCTGA ATACGCTGCA GCAGAAATTC
TCGACGTCCT CCGGCGTCAA CATGGACGAG GAGATGGCGC ATCTGCTGTC GCTGCAGAAC
GCCTACGCGG CGAACGCGCG GGTGATGTCG ACCATCAACC AGATGTATCA GTCCCTGATG
CAGGCGATTT GA
 
Protein sequence
MSLGDALAIA MAGLRVNQAT MSLVSSNVAN AETPGYVRKT VNQSAVNAGE TGTSVKVVGV 
DRQLDDYILA QLRTETSGGA YASLRSDFLK QLQGLFGDPN SSGTLENSYN GLTTALQALA
TSPDSTSARI AVLNAAQAIS SNLNSTSNGI QTLRAGCETG IADAVTTANN LMQQIANINT
HIQTNPLGGT STDAATSAML DQRDQAINQL SQLMDIRVVT NGANQVTVFT GSGVQLVGME
AAKLTFNAQG TVSPTTMWNP NTAISDLGSI RVGYANGSSS DLTSSLKSGK LAAYVELRDK
TLVQAQTQLD QFAASLSSAL SDKTTAGTAV TSGTQAGFSL DLSSMKTGNT VNISYTDTLT
GAQHTVTVVR VDDPSVLPLP QNATADPNDY AVGIDFSGLS GSVVSQLNAA LNSRNLQFSG
TAPNITVLNN AGFSTITAAS VTSTETSLTG GTASVPLFND NGAAYTGAIN GYGSQMTGYA
QRISVNADLI KDNSRLVVYS TSPLTAAGDT TRPDFLVKQL NTSKYLYSAK TGIGSDAAPY
KGTLLSYLQQ FVSQQGSNAA AAQQLSEGQN VVLNTLQQKF STSSGVNMDE EMAHLLSLQN
AYAANARVMS TINQMYQSLM QAI