Gene Francci3_4487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4487 
Symbol 
ID3907463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5358238 
End bp5359698 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content65% 
IMG OID637881819 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_483562 
Protein GI86743162 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.493572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA CGCCGGCACC TATCAAGGCC GAGACCGAGG CGATGATCGC CGAGGTGCTG 
TCTAACTACC CTGCGAAGAC GGCCAAGTTC CGGGCCAAGC ACCTCAAGGC CAACGACCCC
GATGGGTCGA AGGAGTGCGA GGTCAAGTCC AACATCAAAT CCCGCCCGGG GGTCATGACG
ATCCGCGGCT GCGCCTACGC CGGTTCCAAG GGTGTGGTGT GGGGCCCGGT CAAGGACATC
GTCAACATCA GCCACGGCCC GGTCGGCTGT GGCCAGTACT CCTGGGCCAC CCGGCGCAAC
TACGCCCGCG GCCCCTGGGG CGTGAATAAC TTCGCCGCGA TGCAGTTCAC GACCGACTTC
CAGGAGAAGG ACATCGTCTT CGGCGGTGAC CCGAAGCTGG AGACGGTCTG CGACGAGATC
GTCGAGCTGT TCCCGCTGGC CAAGGGGATC TCGGTCCAGT CCGAGTGCCC GATCGGTCTG
ATCGGCGATG ACATCGAGGC GGTTTCCCGC AAGGCCTCGA AGAAGCTCGA GCTGCCGGTC
GTCCCGGTGC GCTGCGAGGG GTTCCGCGGG GTCAGCCAGT CCCTCGGCCA CCACATCGCC
AACGACGCGG TCCGCGACCA CGTGCTGGGT ACCGGCGGGA ACACCTTCAA GCAGACGCCG
TATGACGTCG CGCTCATCGG CGACTACAAC ATCGGCGGCG ACGCCTGGGC CTCCCGGAAG
ATCCTGGAGG AGATGGGCCT GCGGGTCATC GCCCAGTGGT CCGGTGACGG CACGATCAAC
GAGATGGCCT CGACGCACCT GTCGAAGCTG AACTTGATCC ACTGCTACCG GTCGATGAAC
TACATCTGCA CGACCATGGA GGAGCGCTAC GGCACTCCGT GGACCGAGTT CAACTTCTTC
GGTCCTACCA AGATCGTCGA TTCGATGCGG AAGATCGCGG CTTTCTTCGA CGACGAGATC
AAGCAGAAGA CCGAGGCTGC GATCGCCCGC TACCAGGTCC GGTTCGACGA GATCACGAAC
GCCTTCCGGC CGCGCCTGGA AGGCAAGCGG GTCATGCTCG CGGTCGGCGG ACTGCGGCCC
CGGCACACCA TCGGTGCCTA CGAGGACCTC GGCATGGAGG TCGTCGGCAC CGGCTACGAG
TTCGCGCACA AGGACGACTA CACCCGTACT TACCCCGCTC TCGCGGAGGG CGTGGTCCTC
TACGACGACC CGACGGCCTT CGAGCTGGAG GAGTTCGCCA AGCGGCTCAA GCCGGACCTG
ATGGGCGCGG GCGTCAAGGA GAAGTACGTC TTCCACAAGA TGGGCGTCCC CTTCCGCCAG
ATGCACTCCT GGGACTACTC CGGGCCATAC CACGGCGTGG ACGGCTTCGC GGTCTTCGCC
CGTGACATGG ACATCGCGAT CAACAGCCCG ACCTGGGACC TCATGGAGAC CCCCTGGTCG
AAGGCCGGAG AGGTGTTCTG A
 
Protein sequence
MTTTPAPIKA ETEAMIAEVL SNYPAKTAKF RAKHLKANDP DGSKECEVKS NIKSRPGVMT 
IRGCAYAGSK GVVWGPVKDI VNISHGPVGC GQYSWATRRN YARGPWGVNN FAAMQFTTDF
QEKDIVFGGD PKLETVCDEI VELFPLAKGI SVQSECPIGL IGDDIEAVSR KASKKLELPV
VPVRCEGFRG VSQSLGHHIA NDAVRDHVLG TGGNTFKQTP YDVALIGDYN IGGDAWASRK
ILEEMGLRVI AQWSGDGTIN EMASTHLSKL NLIHCYRSMN YICTTMEERY GTPWTEFNFF
GPTKIVDSMR KIAAFFDDEI KQKTEAAIAR YQVRFDEITN AFRPRLEGKR VMLAVGGLRP
RHTIGAYEDL GMEVVGTGYE FAHKDDYTRT YPALAEGVVL YDDPTAFELE EFAKRLKPDL
MGAGVKEKYV FHKMGVPFRQ MHSWDYSGPY HGVDGFAVFA RDMDIAINSP TWDLMETPWS
KAGEVF