Gene Francci3_4330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4330 
Symbol 
ID3907300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5172263 
End bp5173813 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content72% 
IMG OID637881659 
ProductATP-binding region, ATPase-like 
Protein accessionYP_483405 
Protein GI86743005 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCA AGCGACCGTC ACCTCGTGGC TCGGAAGGCG ACTTCTCCGT CGTCTACAGT 
GCTCTGGTAG CGGCAGCCCT GACCGCACTC TGTACAGTGG CCGCCATCCT TCTCGTCGCC
CCGCACGCCC GGCGCGCTGT TGCCGTGTGC GGCCTCGTCG CGGCGGTCAC GGTCGGCCTG
GCCTGCGCCG AGATCCGCCG CCGCGGGCAC GAGATCGTCC ATCTGCGCCG TCGGCAGGCC
GCCACCGAGC AGGAACTCAT GCACCGCCTG GAGATACAGG AGGCCGAGAC CAGCCGCCTC
GCCGTCGAGG TGCTGCCCGC CGCGATCAGT CGGCTGCACC GCGGCGACTC GGTCGAGGAG
GCGCTGTCCG GCTCGAGCCA GTCGCCCGCC CTCGGCGCCG CGTTCATCAA GGCGCACGAC
GACGTGGTCC GCTCGGTCCT CGAAGCGGTC AAGGCGGAAG AGGACCTGCG CGACTCCTCG
CAGCGCGCGT TCGTCAACGT CGCCCGTCGC GTCCAGGCCA TCGTGCACCA GCAGTTCCAG
GATCTGCGCG AGATGGAGGA GCGGCACGGC GGGAACCCCG ATGTCTTCGA CGACCTGCTC
CGCCTCGACC ACGGCACCGC GCTGATCGGC CGGCTCGCCG ACTCGCTCGC CGTGCTCGGC
GGATCCCGCC CCGGCCGGCA GTGGCAGCAT CCGGTGCCGC TGTTCAACGT GCTGCGCGGC
GCGATGTCCC GGATCATCGG CTACCAACGC GTCGAACTGC ACTCGGTGGT CGAGGTCGCC
ATCTCAGGAC CCACCGTCGA ACCGCTCATC CACGCGCTCG CCGAGCTGCT GGACAACGCC
ACCCGGTACT CGCCGCCGCA GACCCGCGTG CACATGACCG CGGTCGAGGT GCAGTCCGGG
ATCGCGGTCG AGATCGAAGA CGGCGGCGTC GGCCTCGGCG AGGAGGCCCG GCTGCGCGCC
GAGCGGGTGC TCGCCGAGGT CAAGTCGGGC CTCGACCTCG CTGACCTCGG CGGGAACCCG
CGCCTCGGCC TGTCCGTGGT CGGTCGGCTC GCGCAGACCC ACCGCTTTCA GGTCTCGCTG
CGGCCCTCGG CGTACGGCGG CGTGCGCGCC GTGCTGATCA TCCCACGTGA CCTGGCCACC
GCCGTGCCGC TGTCCGCGGC CAGCCCGGGC GCCGCGGTCC GGCCGAACCC GAACACGCCG
ATGCCCGCCA TCGCCCTGGA GTCGGAGGAG CCGCAGCGCA TAAAGATCAA CAACAACGGC
CTGCCGCAGC GCAGGCGCGG CGTCACCGCC CCGCTCAACG GGAGCGGCAG CCAGCGGAGC
CTGACCGCGC CGCGCGGGCC GCAGTCCGCC GCCCCGCCCC CGCCTGTGTC GAGTAACGCA
TCCGCTCCCG CCGCCCCGCC CCCGCCCGTG TCGAGTAACG CATCCGCTCC CGCCGCCCCG
CCCCCGCCGC CCGGGCTGTG GCTTGCCGCG TTCCAGAACG CGGTCTCCGG CGAAAGCACG
AGCACCCAGC ACGCTTTGAC TGACGATGCG TCGAGTAAGG AAAGCGAGTA G
 
Protein sequence
MKRKRPSPRG SEGDFSVVYS ALVAAALTAL CTVAAILLVA PHARRAVAVC GLVAAVTVGL 
ACAEIRRRGH EIVHLRRRQA ATEQELMHRL EIQEAETSRL AVEVLPAAIS RLHRGDSVEE
ALSGSSQSPA LGAAFIKAHD DVVRSVLEAV KAEEDLRDSS QRAFVNVARR VQAIVHQQFQ
DLREMEERHG GNPDVFDDLL RLDHGTALIG RLADSLAVLG GSRPGRQWQH PVPLFNVLRG
AMSRIIGYQR VELHSVVEVA ISGPTVEPLI HALAELLDNA TRYSPPQTRV HMTAVEVQSG
IAVEIEDGGV GLGEEARLRA ERVLAEVKSG LDLADLGGNP RLGLSVVGRL AQTHRFQVSL
RPSAYGGVRA VLIIPRDLAT AVPLSAASPG AAVRPNPNTP MPAIALESEE PQRIKINNNG
LPQRRRGVTA PLNGSGSQRS LTAPRGPQSA APPPPVSSNA SAPAAPPPPV SSNASAPAAP
PPPPGLWLAA FQNAVSGEST STQHALTDDA SSKESE