Gene Francci3_3178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3178 
Symbol 
ID3903903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3767555 
End bp3768700 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content70% 
IMG OID637880502 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_482264 
Protein GI86741864 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0982529 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.120585 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTG TCACGTCGCC ATCCGGGCAG GTGCCGCCCT CCGGCCCCGG CTGCGCGCCC 
GATGCCACCG ATTCAGCCGT GTCGGCCGCG CAGGACGGCC CACTGCCTGC CGCCGCCTTC
GACCTGCTAC CGGATTCGGT GATCGTGACC GACGCCGCCG GCGTCGTCGA GGTGTTCAAC
CGGGCGGCGG CGAAGCTCAC CGGGGTGGAT GGACCGTCCG CGATCGGCCG GCATCTGACC
GAGGTCCTCC CGTTGCTGGA CGAGCGGGGC AACGACTGGT GGGAGTGCTC TACAGCCAGC
CGGAACCTGC CGCGGGTCAC CGGGCAGCCG GAGCGGCGCC TGACCTACGC GGGTCCAGTC
CACGACCGTG ACTTCCACGT CACGGTGCGG TTCAACCGGG TCGCCGGTCG GCTGGTGCGG
GTGTCGCTGT CCCTGCGCGA CACCCTCAGC CGCGAGCGGC TGGAACGCAA CCGGGCGGAC
CTGGTCGCGA CCGTGGCGCA CGAGCTGCGC TCGCCGCTGA CCAGTGTGAA GGGATTCACC
GCGACGCTGC TCGCCAAGTG GGACCGGTTC ACCGACGAGC AGAAGAAGCT CATGCTGAAC
ACGGTCAACA CCGACGCCGA CCGGGTCACG CGCCTGCTCA CCGAGGTGCT GGACGTCTCT
CGGATCGACT CCGGACGCAT CCAGGTCCGC AAGCAGATCG TCGACCTGCC CGCCCGGGTG
CGTTCGGTGG TGGACGGCAA GGTGGCCTCG GGCGCGGCGG GCGCCGACCG GTTCTTCATC
CGCGAGGAGG GCGAACTGCC CGAGATGTGG GTCGATCCGG ACAAGATTGA ACAGGTACTG
CACAACCTCG TCGACAACGC CCTGCGGCAT GGTGCGGGTA CTGTGACGGT ACTGCTGCGT
GGCAGGGACA GCGGCACGGA GGTAAGCGTG GCCGACGAGG GTGAAGGCGT GTCCGAATCG
AACGCCGCGC GCGTGTTCAC GAAGTTCTGG CGGGGCGCCA GCCGCGGGAA CGGCACCGGC
CTCGGGCTCT ACATCGCCAA GGCGCTGATC GAGGCGCACG GCGGCACCAT CTCGGTCGGC
CGGGCCCCGG GTGGCGGCGC CGAGTTCCGA TTTTTCGTGC CCGCCGGTGG CCCGGTGTTC
GGCTGA
 
Protein sequence
MNIVTSPSGQ VPPSGPGCAP DATDSAVSAA QDGPLPAAAF DLLPDSVIVT DAAGVVEVFN 
RAAAKLTGVD GPSAIGRHLT EVLPLLDERG NDWWECSTAS RNLPRVTGQP ERRLTYAGPV
HDRDFHVTVR FNRVAGRLVR VSLSLRDTLS RERLERNRAD LVATVAHELR SPLTSVKGFT
ATLLAKWDRF TDEQKKLMLN TVNTDADRVT RLLTEVLDVS RIDSGRIQVR KQIVDLPARV
RSVVDGKVAS GAAGADRFFI REEGELPEMW VDPDKIEQVL HNLVDNALRH GAGTVTVLLR
GRDSGTEVSV ADEGEGVSES NAARVFTKFW RGASRGNGTG LGLYIAKALI EAHGGTISVG
RAPGGGAEFR FFVPAGGPVF G