Gene Francci3_1083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1083 
Symbol 
ID3906426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1292146 
End bp1293414 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content74% 
IMG OID637878417 
Producthypothetical protein 
Protein accessionYP_480194 
Protein GI86739794 
COG category 
COG ID 
TIGRFAM ID[TIGR02678] conserved hypothetical protein TIGR02678 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000423957 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGG ACTCGGTCGC CGAGGAGAAC CGGGCCGCGC GGCGCAGGGC CCTGCGCGCG 
CTGCTGGCCG GGCCGCTACG CACCGCCGGC GCGGACGACG AGGTCCTGCG GCTGGTCCGC
CGGCACGCGT CCGAGCTGCG CGAGTGGCTC GCCGCCGAAA CCGGCTGGCG GCTGGTCGTC
GACGCCGAAT CGGCGCGGCT GTTCAAGACG GCGGCGACCA TCCAGGACGA CACGCACCCG
GCGCGGGAGG GGAAGGGCCG AGCGCCGTTC GGGCGGCGCC GCTACGTCCT GCTGTGCCTC
GCGCTGTCAG TGCTGGAGGG GGCGGACACC CAGATCACCC TCGGCCGGTT GGCCGAAGGG
GTGCTGGTCG CGGCGAGCGA CCCGGAACTG GCCCGCACCG GGGTCACCTT CACCCTCAGC
CGCCGGGACG AACGTTCCGA CCTCGTGGCG GTGGTCCGGC TGCTGCTCAC GCTCGGCGTG
CTGGATCGGG TCGCCGGCGA GGAGGACGCC TATCTGCGCG ACAGCGGCGA CGCGTTGTAC
GACGTACGCC GCAGGGTGCT CGCGTCGCTG CTGACCGGTA CCCGCGGCCC GTCGACCATC
GACGCCGACG ACATCGGGGC GCGGCTCGCC GAGCTGACCC ACGAGCCCGT CCCCGACACC
GACGACCTGC GCAACCGGTC GCTTCGCCGC CGGTTGACCC GCCGGCTGCT CGATGATCCC
GTCGTCTATT ACGACGAGCT CGCCGAGGAC GAGCGCGCCT ACCTGATCAG CCAGCGCCGG
GCGATCACCA GACGGATCGA GGACGCCACC GGTCTGATCG CCGAGATGCG CGCGGAGGGA
ATCGCGATGG TCGACCCCGA CGACGAGCTC ACCGACGTAC GGATGCCGGA ACAGCGCACC
GACGGCCACG TGACCCTGCT AGTCGCCGAG TATCTCGCCA CCCGGCCGGA TCCCGCGGAG
CCGGTGCCGG TCGGCCGGCT GCGTGGATAC GTTCGGAAGA TGGCGGCCGA GCATTCCACC
TACTGGCGGC GGGGCGTCAC CGAACCGGGT GCCGACGCCG AGCTGCTCGC CATGGCGCTG
GACAAGCTGC GCGCGCTGCG GCTGGTCACG GACGTGCCGG GCCGGGCCGG CGAGCCGCCG
GCAGTGCTCG CCCGGCCCGC GATCGCGCGC TACGCCGTCG AGGCGCCGAC GATCCACGAC
GGCCGGGCGG GCGGCGCCGG CCCGGTCAGA AGCGGTCCGG CCAGAAGGAA GAAGACGAGC
CGCCGATGA
 
Protein sequence
MTADSVAEEN RAARRRALRA LLAGPLRTAG ADDEVLRLVR RHASELREWL AAETGWRLVV 
DAESARLFKT AATIQDDTHP AREGKGRAPF GRRRYVLLCL ALSVLEGADT QITLGRLAEG
VLVAASDPEL ARTGVTFTLS RRDERSDLVA VVRLLLTLGV LDRVAGEEDA YLRDSGDALY
DVRRRVLASL LTGTRGPSTI DADDIGARLA ELTHEPVPDT DDLRNRSLRR RLTRRLLDDP
VVYYDELAED ERAYLISQRR AITRRIEDAT GLIAEMRAEG IAMVDPDDEL TDVRMPEQRT
DGHVTLLVAE YLATRPDPAE PVPVGRLRGY VRKMAAEHST YWRRGVTEPG ADAELLAMAL
DKLRALRLVT DVPGRAGEPP AVLARPAIAR YAVEAPTIHD GRAGGAGPVR SGPARRKKTS
RR