Gene Francci3_3410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3410 
Symbol 
ID3905650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4050547 
End bp4051671 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content71% 
IMG OID637880733 
Productserine/threonine protein kinase 
Protein accessionYP_482493 
Protein GI86742093 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00988586 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTACGG TCGCCTCTGA TTTGGAGATC GTCGAACGGT TGCGTTACGG GGCGTCCTTC 
GCGTCGGTGC CGCGCACCTG GGGCCGTGGC CGGTACCGCT GCGTGCGTCA TCTCGCCGAG
GGCGGCCAAG GTTACGTCGA GCTCGCCCGG GACGAGTGGA GCAACGCCCT GGTCGTCGTC
AAGGGCGCAT GGTGGGGCGG CCGCGACCAC GACATCAATC CCGAGTACGC CAGGACCCAG
TACGAGAAGC GCTCGATCGA CGTCGAGGAC GCGGTCGCCG TGCAGGCGGC GCTCGGCGAG
ATCACCCACG GGGTGCCGGC CCTGGTGGAC GTCGTCTACG GGCCCTCGCC GACCCGGCAC
GACCATAACG CGCTCGTGAC CGGGGGCAAC GAGCGGGAGG TCGAGCGGTA CAACCGTGAG
GCGTTCATCG TCATGCAGTT CATCGGTGAC ATCGGGCAGA TGGTGCCCAC CACGCTTGAC
TCCCGGGTCA CCGAGTCCGG CCCGCTCAGC GCCCGCCAGG TCGTCGAGCT GGCCGACCAG
ATCAGCGCCA CGCTGGAGGC GATGCACACG ATCCGACCGC AGCGGCTCTA CCAGCACGAG
GAACGGATCA GGGGGTACTG GGTCCACGGC GACGTCAAGC CGGAGAACAT CCTCGTCGCT
GGTGACCCAC CCCGGTACTC GCTCGTCGAC CTGTCGACGG CGGCGATCGT CGAGCCGTCG
GCCAAGGTCA TGCCCACCAC CGCGACCCCG GGCTACGCCC CGCCGGGAGC CGAGCCGCTC
AGCCCGCAGT ACGACCTGCA CTGCCTGGCG GCCACCCTGC TGTTCGCGCT GACCGGTGAC
CGCCCGGATG ACTGGCTCGG CGGGGCGACC GAGGCCAGGT CGGCCGCGGA CGCCGCCGGT
TCCCGGGCTG ATGCCGAGGC GCGGGACGAG AAGCTGCGGC AGCTGCGGGG TGAGCTCGCG
GCCCGTCGCG TGCACCCCAT GCTGATCCGC TTGATCACCG ACTGCCTGCC CGCCGATCCG
CGTTTCCGGC TCGGCACCGC CACCGTGCTA CGCGCGGAGA TCGCCGCGGT GCGTACCGCG
CTGGTCGCCC GCGAGGTCCT GTCGGATGAG GAGCCGCAAC CGTGA
 
Protein sequence
MSTVASDLEI VERLRYGASF ASVPRTWGRG RYRCVRHLAE GGQGYVELAR DEWSNALVVV 
KGAWWGGRDH DINPEYARTQ YEKRSIDVED AVAVQAALGE ITHGVPALVD VVYGPSPTRH
DHNALVTGGN EREVERYNRE AFIVMQFIGD IGQMVPTTLD SRVTESGPLS ARQVVELADQ
ISATLEAMHT IRPQRLYQHE ERIRGYWVHG DVKPENILVA GDPPRYSLVD LSTAAIVEPS
AKVMPTTATP GYAPPGAEPL SPQYDLHCLA ATLLFALTGD RPDDWLGGAT EARSAADAAG
SRADAEARDE KLRQLRGELA ARRVHPMLIR LITDCLPADP RFRLGTATVL RAEIAAVRTA
LVAREVLSDE EPQP