Gene Francci3_1446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1446 
Symbol 
ID3903178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1736879 
End bp1738819 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content73% 
IMG OID637878783 
Productserine/threonine protein kinase 
Protein accessionYP_480552 
Protein GI86740152 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0259009 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGCGA TGTCGCTGCG CTCTGGCGAT CCGGAGCGGA TCGCCCGGTT CACGCTCACC 
GCGCGGCTCG GCTCCGGCGG TATGGGAGTC GTCTACCTCG GCATCGACGA CGAGACCGGA
GGGCCGGTCG CCCTCAAGGT GATCCGGTCG GACTTCACGG CCGATCCCGA GTTCCGCAGC
CGGTTCCGTC GCGAGGTCGC GGCCGCCCGG GCGGTCGACG GTGCCTGCAC CGCGAGGCTG
GTCGATGCCG ATCCCGACGC CGAGGATCCG TGGATGGCAA CCGAGCACAT CCACGGCCAG
AGCCTCGCCG AGGCGATCGC CGACCGGGGC GCGCTGGCCA TGCCGGTCGT CATGGCGCTG
GCGACCGGGC TCGCCGAGGC CCTGAAGTCC ATCCACGACG CCGGCATCGT CCATCGGGAC
CTGAAACCGG GCAACGTGAT CCTCAGCGAG GACGGCCCGA AAGTGATCGA CTTCGGCATC
GCGGCGGCGG TCGACGCGAC GGCCGCTACC AGGACGGGGG TGTTGCTGGG CAGCCCGGGT
TACATGGCCC CGGAGCAGGT CACTGGACGC GGCGAGATCG GCCCGCCGGC GGACGTCTTC
GCCTGGGGCC TGACGGTACT GTTTGCCGCG TCGGGGCGAC CCCCCTTCGG CGCCGGCCGG
CCCGACGCAC TGCTGTACCG GGTGGTCCAC GACGAGGCCG ACACCGGGGA TGTTCCGCCG
GCCCTGCGCC CAGCGGTCCG CGCGGCGCTG CTCAAGGAGC CCATGGCCCG GCCGAGCGCC
CACGCGTTGC TGCGCGTGCT GATCGGGTCT ACTGGCGACC CGGGGCGCGA GACACGCCGG
ATACTGCGGG ACGCCTGGCT CGCTCCACCG GTGGCGACCC GTGTGAGGAG CCTTCTGCCG
GCCGTGGAGG AGGATACCCA CGACGCCAAG ACGCAGATCC GCCTCGATGC GCCGGAGAGC
GCCGGCGCGG GGACAAACGG CGCGGGGACA GCGGACCGGG CGCCACTCGG GACCGGTGGT
CCGGCGGGCG AGTCCGGGCA GGTTGGCGAG TCCGGGCAGG TTGGCGAGTC CGGGCAGGTT
GGCGAGCTTG GGCGGGTCCG CGGGTCCGCG CCCGGCGAGG ATGACGGTCC GGCCGGTGGC
CGGCCGACTG GAGATGATCA TGATGTGGCC GCTTCCGCCG AGGTAGCTTC CGCCGGGGCG
AAGCGGCCCC TGTCCGGCCG GCGGGCCCGC GCCCGACGCC ACCCGCGCCG GACCGCGGGT
CTGGCCGCCC TGGCCGCGAC ACTCGCGGGC CTGGCCGCCC TGAGCGCATC CGAGGCCGCC
AGCGACCTCG GTCCGCGGGC CCCCGGGACG GCCGCCAGCT CCCGGGCGAC GGGCGGGGAA
CTGGAGCCCG GCCCGCACGA CGTGCCACCC ACCCCGTCGT CGCGTGACGA CGTCCCGGCC
CGATCGGGAC GTACTCCTGG GCCACTGGGG CTCCCCTCAC CCTCTGTCAT CGTTCCTCCC
GCCGGTTTGT CGTCGGGGCG GCCGATCCCC GGGTTCCGGG GCACGGTCGG CCAGCTCACA
CAGGCCAAGG CGTTCACCGA CTTCGTCGCC GCCCACGACA CCCAGATCAT CTTCCTCGAC
ATCTCCACCC TCGCCGAGGG TAACGAAGGG GCCTTCTACC CCGGACCGGA CTTTGGGGGG
GACAACCGGC CCAACTTCAC CCTCTTCGAC GCGTGCGGGG CGCTCGGGCC GGGGGAGCCG
CCCGGCTTCG AGCCGGGGAA GGAGTGCTTC GGGAGCACGT ACCGCCTTGC CGAGATGGCG
CACAGCGGTG CTTCCTTCGG CTACGTGCAG GGTTCCTACC GGCTGCGTGG TTATTTCTGG
GTCGACCTGG TACCCGGAAT GCACCAGGGA TTCGCGAACA TCAATCTGCG GGCCGTCGAC
GTCCGGGATC TTCCGCGTTG A
 
Protein sequence
MPAMSLRSGD PERIARFTLT ARLGSGGMGV VYLGIDDETG GPVALKVIRS DFTADPEFRS 
RFRREVAAAR AVDGACTARL VDADPDAEDP WMATEHIHGQ SLAEAIADRG ALAMPVVMAL
ATGLAEALKS IHDAGIVHRD LKPGNVILSE DGPKVIDFGI AAAVDATAAT RTGVLLGSPG
YMAPEQVTGR GEIGPPADVF AWGLTVLFAA SGRPPFGAGR PDALLYRVVH DEADTGDVPP
ALRPAVRAAL LKEPMARPSA HALLRVLIGS TGDPGRETRR ILRDAWLAPP VATRVRSLLP
AVEEDTHDAK TQIRLDAPES AGAGTNGAGT ADRAPLGTGG PAGESGQVGE SGQVGESGQV
GELGRVRGSA PGEDDGPAGG RPTGDDHDVA ASAEVASAGA KRPLSGRRAR ARRHPRRTAG
LAALAATLAG LAALSASEAA SDLGPRAPGT AASSRATGGE LEPGPHDVPP TPSSRDDVPA
RSGRTPGPLG LPSPSVIVPP AGLSSGRPIP GFRGTVGQLT QAKAFTDFVA AHDTQIIFLD
ISTLAEGNEG AFYPGPDFGG DNRPNFTLFD ACGALGPGEP PGFEPGKECF GSTYRLAEMA
HSGASFGYVQ GSYRLRGYFW VDLVPGMHQG FANINLRAVD VRDLPR